Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read Aloud Integration Method? #801

Closed
ihateprogramming88 opened this issue Jul 15, 2024 · 14 comments
Closed

Read Aloud Integration Method? #801

ihateprogramming88 opened this issue Jul 15, 2024 · 14 comments
Labels
enhancement New feature or request

Comments

@ihateprogramming88
Copy link

Not particularly a feature request, I wanted to work on the read aloud feature. Is there any particular way that you want this implemented?

@ihateprogramming88 ihateprogramming88 added the enhancement New feature or request label Jul 15, 2024
@Jackson-Nickk
Copy link
Contributor

Please use a good neural network based engine so it wouldn't be a robotic and annoying voice reading the text. There are numerous open source read aloud engines out there that you can use.

@ihateprogramming88
Copy link
Author

Does anyone have any recommendations for neural network engines that can be used?

@Jackson-Nickk
Copy link
Contributor

A simple google search brought me to this:

https://github.com/espeak-ng/espeak-ng

They say it's lightweight and low footprint. It even has an android implementation for testing which I tested and it was OK. Its development doesn't seem to be dead as the last commit was 8 days ago.

But that's just my search results, and I am not a developer myself. Please probe this project and search for other TTS models to find the one most suitable.

@Jackson-Nickk
Copy link
Contributor

This one also seems interesting:
https://github.com/coqui-ai/TTS

@Jackson-Nickk
Copy link
Contributor

Jackson-Nickk commented Jul 15, 2024

Android has its own API called Google Speech Recognition & Synthesis. Third party apps can use the built in API to get accessibility features. But it is not as good as other neural models. And also, it relies on the provided API from android, which means if you're using a custom ROM android that ripped this API out, the ReadYou app cannot produce TTS on its own.

@Jackson-Nickk
Copy link
Contributor

This one also has good results:

https://github.com/rendchevi/nix-tts

But its development seems dead.

@Jackson-Nickk
Copy link
Contributor

Mozilla has its own TTS engine too:

https://github.com/mozilla/TTS

It's also good but I don't know if it's lightweight or suitable for this job. Pocket text to speech feature is good, but I couldn't find any evidence that Pocket is using Mozilla's TTS model or not.

This one also didn't see the charm of development in 3 years. I don't know how that would effect you or the implementation. But you can always train the model yourself I guess.

@candideu
Copy link

Reeder is another FOSS RSS reader that does TTS read aloud pretty well, but I'm not sure what library is being used, and there's only one voice: https://github.com/spacecowboy/Feeder

Another engine to look into is Piper TTS: https://github.com/rhasspy/piper

My only other suggestion, from a user experience perspective, would be to highlight the paragraph being read, and having the ability to click on a paragraph to go back to it. This way users can replay or skip to the sections of their choice.

Here are some examples from other reading apps:

Read Aloud

image

AblePlayer

https://ableplayer.github.io/ableplayer/demos/video3.html

Hyperaudio

https://hyperaudio.github.io/hyperaudio-lite-editor/#hypertranscript=22.1,22.3

@ihateprogramming88
Copy link
Author

Great, thanks for the input, I'll do some research and decide what is best suited. Hopefully will get started soon.

@Ashinch
Copy link
Owner

Ashinch commented Aug 9, 2024

Temporary solution: Utilize the ability to read aloud in webview.

@Ashinch
Copy link
Owner

Ashinch commented Aug 9, 2024

#568

@Ashinch Ashinch closed this as completed Aug 9, 2024
@ihateprogramming88
Copy link
Author

Awesome, nice stuff, I have downloaded the latest build, how do I use read aloud. I can see bionic reading is working nicely. Just can't figure out how to use read aloud, WebView is enabled. @Ashinch

@Ashinch
Copy link
Owner

Ashinch commented Aug 12, 2024

Awesome, nice stuff, I have downloaded the latest build, how do I use read aloud. I can see bionic reading is working nicely. Just can't figure out how to use read aloud, WebView is enabled. @Ashinch

In Webview, needs select text and expand text tool to tap "Read aloud" button.

I‘m looking for a more convenient way to trigger.

@ihateprogramming88
Copy link
Author

I can't seem to get this to work on GrapheneOS, do you need any extra applications installed, I have google speech synthesis installed but when I delete the text in WebView no option pops up with read aloud at the moment on latest nightly build.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants