- dataset: opus
- model: transformer
- source language(s): cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans nan wuu yue_Hans yue_Hant
- target language(s): cjy_Hans cjy_Hant cmn cmn_Hans cmn_Hant eng gan hak hak_Hani hsn_Hani lzh lzh_Hans nan wuu yue_Hans yue_Hant
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus-2020-10-04.zip
- test set translations: opus-2020-10-04.test.txt
- test set scores: opus-2020-10-04.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.eng-zho.eng.zho | 27.9 | 0.234 |
Tatoeba-test.multi.multi | 28.8 | 0.433 |
Tatoeba-test.zho-eng.zho.eng | 30.1 | 0.498 |
Tatoeba-test.zho-zho.zho.zho | 14.1 | 0.102 |