- dataset: opus
- model: transformer-align
- source language(s): cmn_Bopo cmn_Hani cmn_Latn hak_Hani yue_Bopo yue_Hani
- target language(s): ind zsm_Latn
- model: transformer-align
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus-2020-06-17.zip
- test set translations: opus-2020-06-17.test.txt
- test set scores: opus-2020-06-17.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.zho.msa | 13.9 | 0.390 |