- dataset: opusTCv20210807
- model: transformer-big
- source language(s): spa
- target language(s): bel bel_Latn orv_Cyrl rue rus ukr
- raw source language(s): spa
- raw target language(s): bel orv rue rus ukr
- model: transformer-big
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels:
- download: opusTCv20210807_transformer-big_2022-03-14.zip
- test set translations: opusTCv20210807_transformer-big_2022-03-14.test.txt
- test set scores: opusTCv20210807_transformer-big_2022-03-14.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
newstest2012.spa-rus | 23.9 | 0.52017 | 3003 | 64830 | 0.990 |
newstest2013.spa-rus | 26.1 | 0.53548 | 3000 | 58560 | 0.993 |
Tatoeba-test-v2021-08-07.spa-bel | 23.3 | 0.50542 | 205 | 1259 | 1.000 |
Tatoeba-test-v2021-08-07.spa-bel_Latn | 6.6 | 1.174 | 1 | 8 | 1.000 |
Tatoeba-test-v2021-08-07.spa-multi | 46.2 | 0.66158 | 10000 | 59935 | 0.994 |
Tatoeba-test-v2021-08-07.spa-orv | 0.9 | 0.13927 | 33 | 142 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rue | 1.6 | 0.18533 | 97 | 319 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rus | 47.1 | 0.67116 | 10506 | 69028 | 0.998 |
Tatoeba-test-v2021-08-07.spa-ukr | 39.0 | 0.61030 | 10115 | 54407 | 1.000 |
- dataset: opusTCv20210807
- model: transformer-big
- source language(s): spa
- target language(s): bel bel_Latn orv_Cyrl rue rus ukr
- raw source language(s): spa
- raw target language(s): bel orv rue rus ukr
- model: transformer-big
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels:
- download: opusTCv20210807_transformer-big_2022-03-17.zip
- test set translations: opusTCv20210807_transformer-big_2022-03-17.test.txt
- test set scores: opusTCv20210807_transformer-big_2022-03-17.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
newstest2012.spa-rus | 23.9 | 0.52017 | 3003 | 64830 | 0.990 |
newstest2013.spa-rus | 26.1 | 0.53548 | 3000 | 58560 | 0.993 |
Tatoeba-test-v2021-08-07.spa-bel | 23.3 | 0.50542 | 205 | 1259 | 1.000 |
Tatoeba-test-v2021-08-07.spa-bel_Latn | 6.6 | 1.174 | 1 | 8 | 1.000 |
Tatoeba-test-v2021-08-07.spa-multi | 46.5 | 0.66422 | 10000 | 59935 | 0.994 |
Tatoeba-test-v2021-08-07.spa-orv | 0.9 | 0.13927 | 33 | 142 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rue | 1.6 | 0.18533 | 97 | 319 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rus | 47.1 | 0.67116 | 10506 | 69028 | 0.998 |
Tatoeba-test-v2021-08-07.spa-ukr | 39.0 | 0.61030 | 10115 | 54407 | 1.000 |
- dataset: opusTCv20210807
- model: transformer-big
- source language(s): spa
- target language(s): bel bel_Latn orv_Cyrl rue rus ukr
- raw source language(s): spa
- raw target language(s): bel orv rue rus ukr
- model: transformer-big
- pre-processing: normalization + SentencePiece (spm32k,spm32k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - valid language labels:
- download: opusTCv20210807_transformer-big_2022-03-23.zip
- test set translations: opusTCv20210807_transformer-big_2022-03-23.test.txt
- test set scores: opusTCv20210807_transformer-big_2022-03-23.eval.txt
testset | BLEU | chr-F | #sent | #words | BP |
---|---|---|---|---|---|
newstest2012.spa-rus | 24.6 | 0.52441 | 3003 | 64830 | 0.985 |
newstest2013.spa-rus | 27.0 | 0.54250 | 3000 | 58560 | 0.984 |
Tatoeba-test-v2021-08-07.spa-bel | 27.4 | 0.54428 | 205 | 1259 | 0.998 |
Tatoeba-test-v2021-08-07.spa-bel_Latn | 6.6 | 1.174 | 1 | 8 | 1.000 |
Tatoeba-test-v2021-08-07.spa-multi | 46.5 | 0.66543 | 10000 | 59935 | 0.993 |
Tatoeba-test-v2021-08-07.spa-orv | 0.9 | 0.13646 | 33 | 142 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rue | 1.5 | 0.17914 | 97 | 319 | 1.000 |
Tatoeba-test-v2021-08-07.spa-rus | 48.5 | 0.68233 | 10506 | 69028 | 0.992 |
Tatoeba-test-v2021-08-07.spa-ukr | 42.1 | 0.63405 | 10115 | 54407 | 0.999 |