-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a documentation page for data quality required for fine-tuning #598
Comments
In the later version, we plan to remove the fine-tune part. Instead, we'll add a series of tools to enhance your reference audio's quality. |
But what if I need to add a new language or a dialect that the model usually doesn't handle? We need to fine-tune the model to accomplish such a task, right? |
True, If you want to fine-tune for a new language (though the next version will support most of spoken languages in the world) ,you may need about 2K hours of low quality data, and about 100h (the more, the better) high quality data (44.1khz with high accuracy label). |
Thanks a lot for your response, that's really helpful, I have one last question: does the data need to be cut to a certain length? |
Yes, we recommend you to cut them into 30s / per segment. |
Thank you very much for your helpful and fast responses |
Thank you for your hard work on this project I was wondering if it's possible to provide a rough estimate for when the next model might be available? Even a ballpark estimate would be greatly appreciated. |
This issue is stale because it has been open for 30 days with no activity. |
Self Checks
1. Is this request related to a challenge you're experiencing? Tell me about your story.
I'm trying to fine-tune the model to be able to pronounce Egyptian dialect.
I currently have a number of long videos -between 6 to 8 hours- that contain Egyptian books and the corresponding audio for different people reading those books, I'm cutting those audios into segments on silence and matching the segments to the text from the books, but I'm lacking some information to do so, such as:
2. Additional context or comments
No response
3. Can you help us with this feature?
The text was updated successfully, but these errors were encountered: