Skip to content

Latest commit

 

History

History
19 lines (15 loc) · 882 Bytes

README.md

File metadata and controls

19 lines (15 loc) · 882 Bytes

opus-2020-06-17.zip

  • dataset: opus
  • model: transformer-align
  • source language(s): cmn_Bopo cmn_Hani cmn_Latn hak_Hani yue_Bopo yue_Hani
  • target language(s): ind zsm_Latn
  • model: transformer-align
  • pre-processing: normalization + SentencePiece (spm32k,spm32k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2020-06-17.zip
  • test set translations: opus-2020-06-17.test.txt
  • test set scores: opus-2020-06-17.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.zho.msa 13.9 0.390