TTS Dataset Maker

Make your own text to speech dataset with this tool. Why should you make one? To replicate people's voices kinda like this and much more.
Fair Warning: It's way harder than you think and this will make it a little less harder

Resources(More to come):

Here is a sentdex video about voice cloning.

ScreenShots:

Download

Install the application in your computer. You can find it in the releases section.

Tutorial

The dataset folder will look like(Similar to LJ speech dataset):

Destination folder:
  -wavs   <===== folder containing the clips
  -metadata.csv <===== csv file containing the clip name and corresponding text

Pr's are welcome

Todo:

To Do:

I/O:

Better Responsive UI.
Add some way to begin from where it was left off.
Add timeline to wavesurfer.
Add keyboard shortcuts for the activities.
Add yt and audio link support.
Better Readme

Core additional features:

Add slow mo option for playback.
Remove silent parts from the clip.

Send me your queries @ [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTS Dataset Maker

Resources(More to come):

Download

Pr's are welcome

Todo:

About

Releases 2

Packages

Contributors 2

Languages

License

danklabs/tts_dataset_maker

Folders and files

Latest commit

History

Repository files navigation

TTS Dataset Maker

Resources(More to come):

Download

Pr's are welcome

Todo:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages