Replies: 3 comments
-
>>> lissyx |
Beta Was this translation helpful? Give feedback.
-
>>> baltasar |
Beta Was this translation helpful? Give feedback.
-
>>> baltasar |
Beta Was this translation helpful? Give feedback.
-
>>> baltasar
[January 7, 2021, 8:47pm]
Hello. I'm starting my journey with DeepSpeech.
Language: Polish slash
DeepSpeech: 9.3 slash
System: ubuntu 20.04 slash
Common Voice: pl_129h_2020-12-11
I managed to run the test training, then tried to train my model on
Polish Common Voice.
python3 bin/import_cv2.py --validate_label_locale /home/validate_label_pl.py --filter_alphabet /home/alphabet.txt /home/utomek/Polskids/cv-corpus-6.1-2020-12-11/pl
this is command i used
and here is the output:
Loading TSV file: /home/utomek/Polskids/cv-corpus-6.1-2020-12-11/pl/test.tsv
Importing mp3 files... slash
ERROR: Inexistent slash --validate_label_locale specified. Please check. slash
Process ForkPoolWorker-1: slash
Traceback (most recent call last): slash
File '/usr/lib/python3.6/multiprocessing/process.py', line 258, in
slash _bootstrap slash
self.run() slash
File '/usr/lib/python3.6/multiprocessing/process.py', line 93, in run slash
self. slash _target( slash *self. slash _args, slash DEEPSPEECH.cdx deepspeech.commands DEEPSPEECH.pages DEEPSPEECH.warc.gz discourse.mozilla.org html-to-markdown.sh shell-conver-html-to-split-posts.sh sorted-deepspeech-posts slash *self. slash _kwargs) slash
File '/usr/lib/python3.6/multiprocessing/pool.py', line 103, in worker slash
initializer( slash *initargs) slash
File 'bin/import_cv2.py', line 54, in init_worker slash
alphabet = Alphabet(params.filter_alphabet) if params.filter_alphabet
else None slash
File
'/home/utomek/tmp/deepspeech-train-venv/lib/python3.6/site-packages/ds_ctcdecoder/init.py',
line 47, in init slash
raise ValueError('Alphabet initialization failed with error code
0x{:X}'.format(err)) slash
ValueError: Alphabet initialization failed with error code 0x1 slash
ERROR: Inexistent slash --validate_label_locale specified. Please check. slash
Process ForkPoolWorker-2: slash
Traceback (most recent call last): slash
File '/usr/lib/python3.6/multiprocessing/process.py', line 258, in
slash _bootstrap slash
self.run() slash
File '/usr/lib/python3.6/multiprocessing/process.py', line 93, in run slash
self. slash _target( slash *self. slash _args, slash DEEPSPEECH.cdx deepspeech.commands DEEPSPEECH.pages DEEPSPEECH.warc.gz discourse.mozilla.org html-to-markdown.sh shell-conver-html-to-split-posts.sh sorted-deepspeech-posts slash *self. slash _kwargs) slash
File '/usr/lib/python3.6/multiprocessing/pool.py', line 103, in worker slash
initializer( slash *initargs) slash
File 'bin/import_cv2.py', line 54, in init_worker slash
alphabet = Alphabet(params.filter_alphabet) if params.filter_alphabet
else None slash
File
'/home/utomek/tmp/deepspeech-train-venv/lib/python3.6/site-packages/ds_ctcdecoder/init.py',
line 47, in init slash
raise ValueError('Alphabet initialization failed with error code
0x{:X}'.format(err)) slash
ValueError: Alphabet initialization failed with error code 0x1 slash
ERROR: Inexistent slash --validate_label_locale specified. Please check. slash
Process ForkPoolWorker-3:
I have polish alphabet file filled with polish letters and this is my
validate_label_pl.py
def validate_label(label):
if 'a' in label: # disallow labels with 'a'
return None
return label.lower() # lower case valid labels
Not sure why it says my file is 'Inexistent'. The alphabet.txt,
validate_label_pl.py and Common Voice files are located inside home
directory. Tried my best to follow documentation and discourse like this
discusion
might end with inconsistent
dataset](https://discourse.mozilla.org/t/no-validate-label-locale-specified-your-might-end-with-inconsistent-dataset/68140)
.clear-badge style='color: #FFFFFF' drop-close='true'
.box}
> Hello I am training my own model with portuguese dataset from Mozilla
> Comon Voice. I am using Colab for that. Running this line:
> !/content/DeepSpeech/bin/import_cv2.py slash --filter_alphabet
> /content/DeepSpeech/data/alphabet.txt '/content/drive/My
> Drive/pt-language' I got this issue: Loading TSV file:
> /content/drive/My Drive/pt-language/test.tsv Importing mp3 files...
> WARNING: No slash --validate_label_locale specified, your might end with
> inconsistent dataset. WARNING: No slash --validate_label_locale sp...
[This is an archived TTS discussion thread from discourse.mozilla.org/t/error-inexistent-validate-label-locale-specified]
Beta Was this translation helpful? Give feedback.
All reactions