- dataset: opus4m+btTCv20210807
- model: transformer
- source language(s): kha khm vie
- target language(s): dws epo lfn
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels: >>dws_Latn<< >>epo<< >>lfn_Latn<<
- download: opus4m+btTCv20210807-2021-09-30.zip
- test set translations: opus4m+btTCv20210807-2021-09-30.test.txt
- test set scores: opus4m+btTCv20210807-2021-09-30.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
Tatoeba-test-v2021-08-07.kha-epo | 1.2 | 0.084 | 19 | 102 | 0.710 |
Tatoeba-test-v2021-08-07.khm-epo | 0.4 | 0.121 | 95 | 424 | 1.000 |
Tatoeba-test-v2021-08-07.multi-multi | 10.8 | 0.327 | 1906 | 14171 | 1.000 |
Tatoeba-test-v2021-08-07.vie-dws | 2.1 | 0.099 | 1 | 3 | 1.000 |
Tatoeba-test-v2021-08-07.vie-epo | 11.3 | 0.335 | 1790 | 13628 | 1.000 |
Tatoeba-test-v2021-08-07.vie-lfn | 8.1 | 0.112 | 1 | 5 | 1.000 |