SOTAVerified

Parallel Corpus Mining

Mining a corpus of bilingual sentence pairs that are translations of each other.

Papers

Showing 113 of 13 papers

TitleStatusHype
Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture TranscriptsCode1
Better Quality Estimation for Low Resource Corpus Mining0
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine TranslationCode1
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining0
Unsupervised Parallel Corpus Mining on Web Data0
ParaCrawl: Web-Scale Acquisition of Parallel CorporaCode1
Parallel Sentence Mining by Constrained DecodingCode1
MUSS: Multilingual Unsupervised Sentence Simplification by Mining ParaphrasesCode1
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationCode1
Hierarchical Document Encoder for Parallel Corpus Mining0
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
Margin-based Parallel Corpus Mining with Multilingual Sentence EmbeddingsCode0
Effective Parallel Corpus Mining using Bilingual Sentence Embeddings0
Show:102550

No leaderboard results yet.