Skip to content

Latest commit

 

History

History
74 lines (55 loc) · 3.18 KB

SHARED.md

File metadata and controls

74 lines (55 loc) · 3.18 KB

Shared Model Cards

Prerequisites of using

  • This document is serving as a quick lookup table for the community training/finetuning result, with various language support.
  • The models in this repository are open source and are based on voluntary contributions from contributors.
  • The use of models must be conditioned on respect for the respective creators. The convenience brought comes from their efforts.

Welcome to share here

  • Have a pretrained/finetuned result: model checkpoint (pruned best to facilitate inference, i.e. leave only ema_model_state_dict) and corresponding vocab file (for tokenization).
  • Host a public huggingface model repository and upload the model related files.
  • Make a pull request adding a model card to the current page, i.e. src\f5_tts\infer\SHARED.md.

Supported Languages

Multilingual

F5-TTS Base @ pretrain @ zh & en

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS Base ckpt & vocab Emilia 95K zh&en cc-by-nc-4.0
MODEL_CKPT: hf://SWivid/F5-TTS/F5TTS_Base/model_1200000.safetensors
VOCAB_FILE: hf://SWivid/F5-TTS/F5TTS_Base/vocab.txt

Other infos, e.g. Author info, Github repo, Link to some sampled results, Usage instruction, Tutorial (Blog, Video, etc.) ...

Mandarin

Japanese

F5-TTS Base @ pretrain/finetune @ ja

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS Base ckpt & vocab Emilia 1.7k JA & Galgame Dataset 5.4k cc-by-nc-4.0
MODEL_CKPT: hf://Jmica/F5TTS/JA_8500000/model_8499660.pt
VOCAB_FILE: hf://Jmica/F5TTS/JA_8500000/vocab_updated.txt

English

French

French LibriVox @ finetune @ fr

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS French ckpt & vocab LibriVox cc-by-nc-4.0
MODEL_CKPT: hf://RASPIAUDIO/F5-French-MixedSpeakers-reduced/model_last_reduced.pt
VOCAB_FILE: hf://RASPIAUDIO/F5-French-MixedSpeakers-reduced/vocab.txt