Replies: 6 comments
-
>>> lissyx |
Beta Was this translation helpful? Give feedback.
-
>>> Tilman_Kamp |
Beta Was this translation helpful? Give feedback.
-
>>> tuttlebr |
Beta Was this translation helpful? Give feedback.
-
>>> tuttlebr |
Beta Was this translation helpful? Give feedback.
-
>>> geotou |
Beta Was this translation helpful? Give feedback.
-
>>> lissyx |
Beta Was this translation helpful? Give feedback.
-
>>> tuttlebr
[April 30, 2019, 3:25am]
Hello,
I am trying to run the import_cv2.py script. I have downloaded the
english commonvoice release and my environment is a docker container
from the repo.
The input
python3 bin/import_cv2.py --filter_alphabet data/alphabet.txt data/cv/en/ --normalize
slash(the alphabet file is also from the repo. same result with or without
slash --normalize)
The error
Saving new DeepSpeech-formatted CSV file to: data/cv/en/clips/train.csv
Traceback (most recent call last):
File 'bin/import_cv2.py', line 158, in
_preprocess_data(params.tsv_dir, audio_dir, label_filter)
File 'bin/import_cv2.py', line 43, in _preprocess_data
_maybe_convert_set(input_tsv, audio_dir, label_filter)
File 'bin/import_cv2.py', line 56, in _maybe_convert_set
for row in reader:
File '/usr/lib/python3.6/csv.py', line 111, in next
self.fieldnames
File '/usr/lib/python3.6/csv.py', line 98, in fieldnames
self._fieldnames = next(self.reader)
File '/usr/lib/python3.6/encodings/ascii.py', line 26, in decode
return codecs.ascii_decode(input, self.errors)[0]
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 8085: ordinal not in range(128)
I'm guessing this is some mp3 prefix or buffer but I'm not really
familiar with it. I searched the github repo as well as this forum and
found no prior issues. Perhaps I'm not running it correctly.
[This is an archived TTS discussion thread from discourse.mozilla.org/t/commonvoice-import-cv2-py-error]
Beta Was this translation helpful? Give feedback.
All reactions