Cross-Lingual Bitext Mining
Cross-lingual bitext mining is the task of mining sentence pairs that are translations of each other from large text corpora.
Papers
Showing 1–6 of 6 papers
All datasetsBUCC French-to-EnglishBUCC German-to-EnglishBUCC Chinese-to-EnglishBUCC Russian-to-English
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Massively Multilingual Sentence Embeddings | F1 score | 93.91 | — | Unverified |
| 2 | Multilingual Sentence Embeddings | F1 score | 92.89 | — | Unverified |
| 3 | Monolingual training data | F1 score | 75.8 | — | Unverified |