Trying to fientune for brazilian portuguese language #184
Unanswered
abhisirka2001
asked this question in
Q&A
Replies: 5 comments 23 replies
-
Please let us know if you get it. |
Beta Was this translation helpful? Give feedback.
3 replies
-
eu espero que você consiga!!!! |
Beta Was this translation helpful? Give feedback.
13 replies
-
What are the steps to train another language and how much ram needed ? |
Beta Was this translation helpful? Give feedback.
2 replies
-
… On Tue, Nov 19, 2024, 5:44 AM Falkker ***@***.***> wrote:
As a Brazilian, I’d say the voice sounds very real and natural, but it’s
still a bit confusing. Even as a native speaker, it can be a little hard to
understand. Some words are accurate but lack the accent.
Hey I trained the model on more datasets and here is one sample. please
let me know if words and accent is better now. I hope model is good in
learning new accent and new language.
https://drive.google.com/file/d/1Adz4EXNa0yC-ZBs_-V3u_ndb-Zv582FV/view?usp=sharing
Thanks
Hey friend! Any update on this one? If you have any new file to check just
let me know. All the best.
Conseguiu treinar em pt-br amigo?
—
Reply to this email directly, view it on GitHub
<#184 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVBPZ3HPUQZJGK5BTN6NL2D2BJ7FRAVCNFSM6AAAAABQIMKNBSVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTCMRZHAZTINA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
5 replies
-
yes because i didnt train a separate tokenizer for the brazilian portuguese
language and kept the original vocab file of F5TTS because it container
majority of the tokens. You can further finetune my model with a new vocab
with less data to get good results.
…On Wed, Nov 20, 2024, 2:05 AM krc1983 ***@***.***> wrote:
Oi, @abhisirka2001 <https://github.com/abhisirka2001> eu estou tendo
resultados ruins com o seu modelo, a voz esta ok, mas a pronuncia esta
bastante confusa, acredito que seja pela falta do arquivo vocab.txt de
portugues, vc pode me dizer onde conseguir? obg!
—
Reply to this email directly, view it on GitHub
<#184 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVBPZ3BZ2TWBGJ6X6AQTZOT2BOOIJAVCNFSM6AAAAABQIMKNBSVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTCMZRGE2DIMQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm fine-tuning the F5 base model for Brazilian Portuguese using 160 hours of audio data with custom tokenizer. After 100 epochs on a single RTX 4090, I've encountered the following issues:
When using an English audio prompt and reference text, the output captures the accent but doesn't produce the expected words. You can listen to the output here.
When using a Brazilian Portuguese audio prompt and reference text, the model generates an empty audio file.
What should I consider or adjust to improve the model's performance from here? Any advice on resolving these issues would be appreciated!
Beta Was this translation helpful? Give feedback.
All reactions