-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updates to demo to include audio part #26
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Love this PR! So nice to see the demo working end-to-end, and I like how you split the audio into speaker chunks so you dont have to wait a really long time to get a huge audio chunk file, smart! And great to have the API docs referenced in the UI.
FYI the demo worked pretty fast and smoothly on my 3-year old laptop with no GPU and 16GB ram, max RAM usage was 8GB! I was a bit worried about that since both the text_to_text
and the text_to_speech
model are loaded simultaneously, instead of sequentially (so that we can offload text_to_text
).
Indeed there is this issue with parler not being able to pronounce well certain words, I had missed that in my local testing. But I think we can keep this as it is and merge #26 into #19 and then #19 into main, and change the TTS
model from next week.
What's changing
parler_tts
installing issues.How to test it
Inside codespaces, run:
Additional notes for reviewers