Cross-Lingual Bitext Mining
Cross-lingual bitext mining is the task of mining sentence pairs that are translations of each other from large text corpora.
Papers
No papers found.
All datasetsBUCC French-to-EnglishBUCC German-to-EnglishBUCC Chinese-to-EnglishBUCC Russian-to-English
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Massively Multilingual Sentence Embeddings | F1 score | 93.91 | — | Unverified |
| 2 | Multilingual Sentence Embeddings | F1 score | 92.89 | — | Unverified |
| 3 | Monolingual training data | F1 score | 75.8 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Massively Multilingual Sentence Embeddings | F1 score | 96.19 | — | Unverified |
| 2 | Multilingual Sentence Embeddings | F1 score | 95.58 | — | Unverified |
| 3 | Monolingual training data | F1 score | 76.9 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Massively Multilingual Sentence Embeddings | F1 score | 92.27 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Massively Multilingual Sentence Embeddings | F1 score | 93.3 | — | Unverified |