- dataset: opus1m+bt
- model: transformer-align
- source language(s): eng
- target language(s): hye
- model: transformer-align
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels: >>axm<< >>hye<< >>hye_Latn<< >>hyw<< >>xcl<<
- download: opus1m+bt-2021-04-10.zip
- test set translations: opus1m+bt-2021-04-10.test.txt
- test set scores: opus1m+bt-2021-04-10.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
Tatoeba-test.eng-hye | 16.4 | 0.400 | 1121 | 5114 | 1.000 |
Tatoeba-test.eng-multi | 16.6 | 0.402 | 1121 | 5115 | 1.000 |
- dataset: opus4m+btTCv20210807
- model: transformer
- source language(s): eng
- target language(s): hye hyw xcl
- model: transformer
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels: >>axm<< >>hye<< >>hye_Latn<< >>hyw<< >>xcl<<
- download: opus4m+btTCv20210807-2021-09-30.zip
- test set translations: opus4m+btTCv20210807-2021-09-30.test.txt
- test set scores: opus4m+btTCv20210807-2021-09-30.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
Tatoeba-test-v2021-08-07.eng-multi | 18.8 | 0.404 | 1121 | 5115 | 1.000 |
Tatoeba-test-v2021-08-07.multi-multi | 18.8 | 0.404 | 1121 | 5115 | 1.000 |