-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experimental Result? #3
Comments
bác check lại transcription lúc chạy benchmark thử xem, khả năng là nó ko ra tiếng việt whisper bị 1 cái là khi đã fine tune thì cái language detection bị giảm khá nặng, e cũng bị như vậy nên lúc nào cũng phải ép cái |
Lúc gọi model mình cũng đã config như vầy lúc train con v3 khoảng 50h data thì ổn nhưng lên 125h thì nó bị vầy mình cũng không rõ tại sao.
nhưng infer nó vẫn ra:
|
cú pháp infer khác mà nhỉ 🤔 hay là bị hallucination 🤔 |
À infer tôi vẫn ép là language="vi". Cùng một settings như vậy tôi chuyển qua V2 chạy thì ổn.
|
vậy chắc do hallucination r, v3 bị cái này khá phiền ngoài ra thì bạn check lại loss curve lúc training để cho chắc |
Hi @phineas-pta,
Recently, I experimented with fine-tuning Whisper using QLoRA. We tried using the Large v3 model, fine-tuning it with four datasets: CMV-17, VIVOS, Fleurs, and 100 hours of VinAI data. The results for the Large V3 model exceeded 100%. I'm curious if you encountered a similar issue? (I reran the experiments three times but the results remained the same). However, when switching to version v2, the results were more effective.
(100h VinAi data)
(100h VinAi data)
Despite fine-tuning with over 100 hours of data, the improvements seemed minimal. If you have experienced a similar situation, I hope you can share your solution.
Thank you very much.
The text was updated successfully, but these errors were encountered: