Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于p开头的发音经常被识别成t开头的发音的问题 #24

Open
wangqiaoch opened this issue Nov 20, 2024 · 8 comments
Open

Comments

@wangqiaoch
Copy link

比如派出,口型生成出来会变成”太出“,没有p这个闭口的动作;
或者脾脏,口型看起来是”提脏“

@DBDXSS
Copy link
Member

DBDXSS commented Nov 20, 2024

您观察太细了,可以分享一下您的视频吗?

@wangqiaoch
Copy link
Author

您观察太细了,可以分享一下您的视频吗?

output.2.mp4

这个是我找大夫录的音频,绝对是脾脏

@DBDXSS
Copy link
Member

DBDXSS commented Nov 21, 2024

脾脏,口型看起来是”提脏“

您发现没有,视频里“脾脏“的发音听起来很像是”提脏“

@wangqiaoch
Copy link
Author

脾脏,口型看起来是”提脏“

您发现没有,视频里“脾脏“的发音听起来很像是”提脏“

你可以关掉视频来听,非常清楚就能听出是脾脏

@wangqiaoch
Copy link
Author

脾脏,口型看起来是”提脏“

您发现没有,视频里“脾脏“的发音听起来很像是”提脏“

可能这两个发音确实是有接近,但我生成的视频里,如果是男声,还没有成功合成出“p”的动作的……包括皮、派等,都没有闭口的动作

@wangqiaoch
Copy link
Author

脾脏,口型看起来是”提脏“

您发现没有,视频里“脾脏“的发音听起来很像是”提脏“

嗯……最开始是用了合成的音频,所以还不确定是音频模型的问题还是口型的问题,后来这段是录的,然后也出现了,就提了这个issue……

@wangqiaoch
Copy link
Author

wangqiaoch commented Nov 21, 2024

是不是需要吐字更清楚一些,然后就可以规避掉这个问题捏

@DBDXSS
Copy link
Member

DBDXSS commented Nov 21, 2024

是不是需要吐字更清楚一些,然后就可以规避掉这个问题捏

建议试试,我听到的更像是”提脏“

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants