GitHub - neonsecret/Cross-Lingual-Voice-Cloning: For monolingual voice cloning, see https://github.com/neonsecret/Real-Time-Voice-Cloning-Multilang

Cross-Lingual-Voice-Cloning

DISCLAIMER :- Based on the paper Learning to Speak Fluently in a Foreign Language:Multilingual Speech Synthesis and Cross-Language Voice Cloning
Differences from original fork:

cleaned tensorflow requirement;
added russian language support;
various code improvements

Dataset Format

The model needs to be provided 2 text files 1 for the purpose of training and 1 for validation. Each line of the txt file should follow the following format :-

<path-to-wav-file>|<text-corresponding-to-speech-in-wav>|<speaker-no>|<lang-no>

<speaker-no> goes from 0 to n-1, where n is the number of speakers.

<lang-no> goes from 0 to m-1 , where m is the number of languages.
Lnaguage-id table:

en : 0
ru : 1

Hparams

hparams.training_files, hparams.validation_files need to be set to the path to the txt files of previous section.

hparams.n_speakers, hparams.dim_yo need to be changed to the number of speakers.

hparams.n_langs must be set to number of languages.

To change the languages, add/remove unicode characters in _letters variable of text/symbols.py .

Inference

For inference using the model , run clvc-infer-gh.ipynb with appropriate speaker and language number.

Name		Name	Last commit message	Last commit date
Latest commit History 196 Commits
.github		.github
filelists		filelists
g2p		g2p
text		text
waveglow @ 1a6fcbf		waveglow @ 1a6fcbf
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
audio_processing.py		audio_processing.py
clvc_infer_gh.ipynb		clvc_infer_gh.ipynb
cross_lingual_voice_cloning.ipynb		cross_lingual_voice_cloning.ipynb
data_utils.py		data_utils.py
demo.wav		demo.wav
distributed.py		distributed.py
gradient_reversal.py		gradient_reversal.py
hparams.py		hparams.py
inference.ipynb		inference.ipynb
layers.py		layers.py
logger.py		logger.py
loss_function.py		loss_function.py
loss_scaler.py		loss_scaler.py
model.py		model.py
multiproc.py		multiproc.py
plotting_utils.py		plotting_utils.py
requirements.txt		requirements.txt
residual_encoder.py		residual_encoder.py
speaker_classifier.py		speaker_classifier.py
stft.py		stft.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset Format

Hparams

Inference

About

Releases

Packages

Languages

License

neonsecret/Cross-Lingual-Voice-Cloning

Folders and files

Latest commit

History

Repository files navigation

Dataset Format

Hparams

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages