Folders and files Name Name Last commit message
Last commit date
parent directory
View all files
dataset: opus
model: transformer
source language(s): bod brx brx_Latn cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans mya nan wuu yue_Hans yue_Hant
target language(s): bod brx brx_Latn cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans mya nan wuu yue_Hans yue_Hant
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-10-04.zip
test set translations: opus-2020-10-04.test.txt
test set scores: opus-2020-10-04.eval.txt
testset
BLEU
chr-F
Tatoeba-test.bod-eng.bod.eng
2.2
0.135
Tatoeba-test.brx-eng.brx.eng
2.8
0.134
Tatoeba-test.eng-bod.eng.bod
3.1
0.132
Tatoeba-test.eng-brx.eng.brx
0.5
0.071
Tatoeba-test.eng-mya.eng.mya
0.4
0.160
Tatoeba-test.eng-zho.eng.zho
28.1
0.239
Tatoeba-test.multi.multi
28.4
0.428
Tatoeba-test.mya-eng.mya.eng
15.6
0.361
Tatoeba-test.mya-zho.mya.zho
3.6
0.056
Tatoeba-test.zho-eng.zho.eng
29.8
0.495
Tatoeba-test.zho-mya.zho.mya
0.4
0.146
Tatoeba-test.zho-zho.zho.zho
13.1
0.099
dataset: opus
model: transformer
source language(s): bod brx cjy cmn eng gan hak hsn lzh mya nan wuu yue
target language(s): bod brx cjy cmn eng gan hak hsn lzh mya nan wuu yue
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>eng<< >>mya<< >>cmn_Hant<< >>cmn_Hans<< >>yue_Hant<< >>bod<< >>cmn<< >>yue_Hans<< >>nan<< >>wuu<< >>lzh<< >>lzh_Hans<<
download: opus-2021-02-16.zip
test set translations: opus-2021-02-16.test.txt
test set scores: opus-2021-02-16.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.multi-multi
26.3
0.400
10000
97851
0.925
dataset: opus
model: transformer
source language(s): bod brx cjy cmn eng gan hak hsn lzh mya nan wuu yue
target language(s): bod brx cjy cmn eng gan hak hsn lzh mya nan wuu yue
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>eng<< >>mya<< >>cmn_Hant<< >>cmn_Hans<< >>yue_Hant<< >>bod<< >>cmn<< >>yue_Hans<< >>nan<< >>wuu<< >>lzh<< >>lzh_Hans<<
download: opus-2021-02-17.zip
test set translations: opus-2021-02-17.test.txt
test set scores: opus-2021-02-17.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.multi-multi
26.3
0.400
10000
97851
0.925
tico19-test.eng-mya
1.9
0.174
2100
32295
1.000
dataset: opus
model: transformer
source language(s): cmn mya nan yue
target language(s): cmn mya nan yue
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>eng<< >>mya<< >>cmn_Hant<< >>cmn_Hans<< >>yue_Hant<< >>bod<< >>cmn<< >>yue_Hans<< >>nan<< >>wuu<< >>lzh<< >>lzh_Hans<<
download: opus-2021-02-18.zip
test set translations: opus-2021-02-18.test.txt
test set scores: opus-2021-02-18.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.cmn_Hans-mya
0.6
0.124
10
40
1.000
Tatoeba-test.cmn_Hans-yue_Hans
7.0
0.072
59
678
1.000
Tatoeba-test.cmn_Hans-yue_Hant
7.2
0.070
26
234
1.000
Tatoeba-test.cmn_Hant-mya
0.9
0.185
7
30
1.000
Tatoeba-test.cmn_Hant-yue_Hans
5.7
0.060
50
546
1.000
Tatoeba-test.cmn_Hant-yue_Hant
14.6
0.119
31
245
0.941
Tatoeba-test.multi-multi
26.3
0.400
10000
97851
0.925
Tatoeba-test.mya-cmn_Hans
2.1
0.039
10
150
0.931
Tatoeba-test.mya-cmn_Hant
6.8
0.083
7
90
0.765
Tatoeba-test.mya-zho
3.6
0.056
17
240
0.872
Tatoeba-test.nan-cmn_Hans
0.8
0.000
1
4
1.000
Tatoeba-test.nan-cmn_Hant
1.9
0.000
1
10
1.000
Tatoeba-test.yue_Hans-cmn_Hans
12.4
0.112
56
602
0.865
Tatoeba-test.yue_Hans-cmn_Hant
8.2
0.057
66
793
0.943
Tatoeba-test.yue_Hant-cmn_Hans
15.6
0.128
35
320
0.915
Tatoeba-test.yue_Hant-cmn_Hant
17.7
0.141
37
326
0.937
Tatoeba-test.zho-mya
0.4
0.146
17
70
1.000
Tatoeba-test.zho-zho
13.1
0.099
2500
29103
0.943
tico19-test.eng-mya
1.9
0.174
2100
32295
1.000
You can’t perform that action at this time.