SOTAVerified

Parallel Corpus Mining

Mining a corpus of bilingual sentence pairs that are translations of each other.

Papers

Showing 110 of 13 papers

TitleStatusHype
Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture TranscriptsCode1
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine TranslationCode1
Parallel Sentence Mining by Constrained DecodingCode1
ParaCrawl: Web-Scale Acquisition of Parallel CorporaCode1
MUSS: Multilingual Unsupervised Sentence Simplification by Mining ParaphrasesCode1
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
Better Quality Estimation for Low Resource Corpus Mining0
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining0
Unsupervised Parallel Corpus Mining on Web Data0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.