Speech to text service, also available to other apps by Stypox · Pull Request #109 · Stypox/dicio-android

Stypox · 2022-12-13T11:19:53Z

Speech to text service

This PR implements a Speech To Text service available to apps, fixing #54. Here is a preview of the feature, after pressing on the microphone button in Google Maps:

It is possible to also open the service from Dicio's navigation drawer, allowing the user to take dictation, copy to clipboard and share, fixing #33.

Testing APK

app-debug.zip

Technical details

This PR supersedes #100 by @nebkrid. #100 implemented the service as a skill, while this PR implements it as its own activity. The research done in #100 was really helpful though! I also kept the TODOs left behind there for later: for example, the result intent from the activity might contain multiple speech interpretations each with some different accuracy, and while Vosk does provide such information, it is currently not added to the result intent for simplicity.

Implemented export of Speech-To-Text functionality for other Apps, which can call this by startActivityForResult with an Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH)

Extra RecognizerIntent.EXTRA_PROMPT is implemented

This PR includes #111, thanks to @nebkrid again :-)

prompt message shows up as hint (if none is provided, default is still "Say something...")

Auto finish preference setting added: Reason: Vosk is good, but at least in German it is not perfect. Therefore it is easier (and faster: avoid waiting for loading vosk model again) if user gets the possibility to confirm or speak anew before reporting the result back to requesting app.

added the TODO from Implementing Speech-To-Text Service for other Apps #100 for optional (and seldom used, if ever) extras like EXTRA_BIASING_STRINGS, EXTRA_LANGUAGE for future reference and remind, which extras may be helpful for vosk recognition to improve the results

This PR also fixes a random crash when cleaning up Vosk, and sets the theme color used in e.g. button texts to a sensible value.

Added prompt message + preference for Auto-finish

sudomain · 2023-05-24T15:49:03Z

Is there an example of starting this activity using am? I've tried many variations of the following, but to no avail:

$ am start -a RecognizerIntent.ACTION_RECOGNIZE_SPEECH -e RecognizerIntent.EXTRA_PROMPT test
Starting: Intent { act=RecognizerIntent.ACTION_RECOGNIZE_SPEECH (has extras) }
Error: Activity not started, unable to resolve Intent { act=RecognizerIntent.ACTION_RECOGNIZE_SPEECH flg=0x10000000 (has extras) }

RokeJulianLockhart · 2023-05-25T18:06:42Z

@sudomain, that's best asked at https://github.com/Stypox/dicio-android/discussions/new?category=q-a

nebkrid · 2023-05-25T20:38:03Z

@sudomain I have no experience with am, but guessing from Error: Activity not started, unable to resolve Intent { act=RecognizerIntent.ACTION_RECOGNIZE_SPEECH flg=0x10000000 (has extras) }: May you have to use directly the string "android.speech.action.RECOGNIZE_SPEECH" (like in the activity's manifest definition)? This is the actual value of RecognizerIntent.ACTION_RECOGNIZE_SPEECH

rkagerer · 2024-12-03T07:13:22Z

Rather than a wakeword, I'd like to set up Dicio to start listening when I hit the Bixby button on my S10+. I've installed Button Mapper Pro and run the required ADB steps to allow it to control that button. Could someone help me figure out what fields to enter below to trigger the correct Dicio intent?

Stypox · 2024-12-03T08:13:06Z

The activity you need to start is https://github.com/Stypox/dicio-android/blob/master/app%2Fsrc%2Fmain%2Fkotlin%2Forg%2Fstypox%2Fdicio%2Fio%2Finput%2Fstt_popup%2FSttPopupActivity.kt, so I think you just need to put org.stypox.dicio.io.input.stt_popup.SttPopupActivity under "package"?

Stypox added 4 commits December 12, 2022 14:58

Use same theme as settings in whole app

21a0fb0

Make BaseActivity behavior overridable

7967dbd

[Vosk] Fix cleanup crashing app

dd3b26d

Add Speech To Text service, also available to apps

4552969

This was referenced Dec 13, 2022

Implementing Speech-To-Text Service for other Apps #100

Closed

Use Dicio as system STT / voice recognition service #54

Closed

[Feature Request]: Take dictation #33

Closed

This was linked to issues Dec 13, 2022

Use Dicio as system STT / voice recognition service #54

Closed

[Feature Request]: Take dictation #33

Closed

Stypox mentioned this pull request Dec 13, 2022

feature request: start the Dicio listening service in background via intent #79

Open

nebkrid and others added 4 commits December 13, 2022 21:47

Implement Prompt Message showing as hint if available

c082696

Added preference setting for Auto-Finish

9db8c99

Merge pull request #111 from nebkrid/stt-service

7bb5d26

Added prompt message + preference for Auto-finish

Improve code in SttServiceActivity

5929289

Stypox merged commit 2ab3251 into master Dec 20, 2022

Stypox deleted the stt-service branch December 20, 2022 15:10

nebkrid mentioned this pull request Jan 15, 2023

Instant availablity of speech input after opening #151

Open

MakeMeCookie mentioned this pull request Jul 6, 2023

modify a function. #180

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speech to text service, also available to other apps#109

Speech to text service, also available to other apps#109
Stypox merged 8 commits intomasterfrom
stt-service

Stypox commented Dec 13, 2022 •

edited

Loading

Uh oh!

sudomain commented May 24, 2023

Uh oh!

RokeJulianLockhart commented May 25, 2023

Uh oh!

nebkrid commented May 25, 2023

Uh oh!

rkagerer commented Dec 3, 2024 •

edited

Loading

Uh oh!

Stypox commented Dec 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

Stypox commented Dec 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!