SOTAVerified

Cross-Lingual Bitext Mining

Cross-lingual bitext mining is the task of mining sentence pairs that are translations of each other from large text corpora.

Papers

Showing 16 of 6 papers

TitleStatusHype
Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages0
Majority Voting with Bidirectional Pre-translation For Bitext RetrievalCode0
Parallel Sentence Mining by Constrained DecodingCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
Margin-based Parallel Corpus Mining with Multilingual Sentence EmbeddingsCode0
Improving Neural Machine Translation Models with Monolingual DataCode1
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Massively Multilingual Sentence EmbeddingsF1 score93.91Unverified
2Multilingual Sentence EmbeddingsF1 score92.89Unverified
3Monolingual training dataF1 score75.8Unverified
#ModelMetricClaimedVerifiedStatus
1Massively Multilingual Sentence EmbeddingsF1 score96.19Unverified
2Multilingual Sentence EmbeddingsF1 score95.58Unverified
3Monolingual training dataF1 score76.9Unverified
#ModelMetricClaimedVerifiedStatus
1Massively Multilingual Sentence EmbeddingsF1 score92.27Unverified
#ModelMetricClaimedVerifiedStatus
1Massively Multilingual Sentence EmbeddingsF1 score93.3Unverified