opus-2020-06-17.zip

dataset: opus
model: transformer-align
source language(s): cmn_Bopo cmn_Hani cmn_Latn hak_Hani yue_Bopo yue_Hani
target language(s): ind zsm_Latn
model: transformer-align
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
download: opus-2020-06-17.zip
test set translations: opus-2020-06-17.test.txt
test set scores: opus-2020-06-17.eval.txt

Benchmarks

testset	BLEU	chr-F
Tatoeba-test.zho.msa	13.9	0.390