Skip to content

danklabs/tts_dataset_maker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TDM

TTS Dataset Maker

Make your own text to speech dataset with this tool. Why should you make one? To replicate people's voices kinda like this and much more.
Fair Warning: It's way harder than you think and this will make it a little less harder

Resources(More to come):
  • Here is a sentdex video about voice cloning.

ScreenShots:
img

Install the application in your computer. You can find it in the releases section.

Tutorial

The dataset folder will look like(Similar to LJ speech dataset):

Destination folder:
  -wavs   <===== folder containing the clips
  -metadata.csv <===== csv file containing the clip name and corresponding text
Pr's are welcome
Todo:

To Do:

I/O:

  • Better Responsive UI.
  • Add some way to begin from where it was left off.
  • Add timeline to wavesurfer.
  • Add keyboard shortcuts for the activities.
  • Add yt and audio link support.
  • Better Readme

Core additional features:

  • Add slow mo option for playback.
  • Remove silent parts from the clip.

Send me your queries @ [email protected]