-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于p开头的发音经常被识别成t开头的发音的问题 #24
Comments
您观察太细了,可以分享一下您的视频吗? |
output.2.mp4这个是我找大夫录的音频,绝对是脾脏 |
您发现没有,视频里“脾脏“的发音听起来很像是”提脏“ |
你可以关掉视频来听,非常清楚就能听出是脾脏 |
可能这两个发音确实是有接近,但我生成的视频里,如果是男声,还没有成功合成出“p”的动作的……包括皮、派等,都没有闭口的动作 |
嗯……最开始是用了合成的音频,所以还不确定是音频模型的问题还是口型的问题,后来这段是录的,然后也出现了,就提了这个issue…… |
是不是需要吐字更清楚一些,然后就可以规避掉这个问题捏 |
建议试试,我听到的更像是”提脏“ |
比如派出,口型生成出来会变成”太出“,没有p这个闭口的动作;
或者脾脏,口型看起来是”提脏“
The text was updated successfully, but these errors were encountered: