Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Voice Conversion' paper candidate 2409.17364 #651

Open
github-actions bot opened this issue Sep 27, 2024 · 0 comments
Open

'Voice Conversion' paper candidate 2409.17364 #651

github-actions bot opened this issue Sep 27, 2024 · 0 comments

Comments

@github-actions
Copy link
Contributor

Please check whether this paper is about 'Voice Conversion' or not.

article info.

  • title: Exploring synthetic data for cross-speaker style transfer in style representation based TTS

  • summary: Incorporating cross-speaker style transfer in text-to-speech (TTS) models is
    challenging due to the need to disentangle speaker and style information in
    audio. In low-resource expressive data scenarios, voice conversion (VC) can
    generate expressive speech for target speakers, which can then be used to train
    the TTS model. However, the quality and style transfer ability of the VC model
    are crucial for the overall TTS model quality. In this work, we explore the use
    of synthetic data generated by a VC model to assist the TTS model in
    cross-speaker style transfer tasks. Additionally, we employ pre-training of the
    style encoder using timbre perturbation and prototypical angular loss to
    mitigate speaker leakage. Our results show that using VC synthetic data can
    improve the naturalness and speaker similarity of TTS in cross-speaker
    scenarios. Furthermore, we extend this approach to a cross-language scenario,
    enhancing accent transfer.

  • id: http://arxiv.org/abs/2409.17364v1

judge

Write [vclab::confirmed] or [vclab::excluded] in comment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants