SOTAVerified

Parallel Corpus Mining

Mining a corpus of bilingual sentence pairs that are translations of each other.

Papers

Showing 110 of 13 papers

TitleStatusHype
Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture TranscriptsCode1
Better Quality Estimation for Low Resource Corpus Mining0
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine TranslationCode1
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining0
Unsupervised Parallel Corpus Mining on Web Data0
ParaCrawl: Web-Scale Acquisition of Parallel CorporaCode1
Parallel Sentence Mining by Constrained DecodingCode1
MUSS: Multilingual Unsupervised Sentence Simplification by Mining ParaphrasesCode1
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationCode1
Hierarchical Document Encoder for Parallel Corpus Mining0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.