Speech Recognition for Uyghur using deep learning

Training:

this model using CTC loss for training.

unzip results.7z and thuyg20_data.7z to the same folder where python source files located. then run:

python train.py

Recognition:

for recognition download only pretrained model(results.7z). then run:

python tonu.py test1.wav

result will be:

        Model loaded: results/UModel_last.pth
            Best CER: 7.21%
             Trained: 473 epochs
The model has 26,389,282 trainable parameters

======================
Recognizing file .\test2.wav
test2.wav -> bu öy eslide xotunining xush tebessumi oghlining omaq külküsi bilen güzel idi

This project using

A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Speech Recognition for Uyghur using deep learning

Files

README.md

Latest commit

History

README.md

File metadata and controls

Speech Recognition for Uyghur using deep learning