SOTAVerified

Cross-Lingual Document Classification

Cross-lingual document classification refers to the task of using data and models available for one language for which ample such resources are available (e.g., English) to solve classification tasks in another, commonly low-resource, language.

Papers

Showing 125 of 25 papers

TitleStatusHype
Multilingual and cross-lingual document classification: A meta-learning approachCode1
ZeRO: Memory Optimizations Toward Training Trillion Parameter ModelsCode1
MultiFiT: Efficient Multi-lingual Language Model Fine-tuningCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
A Corpus for Multilingual Document Classification in Eight LanguagesCode1
Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling0
Robust Cross-lingual Embeddings from Parallel SentencesCode0
Wasserstein distances for evaluating cross-lingual embeddings0
Bridging the domain gap in cross-lingual document classificationCode0
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification0
Variational learning across domains with triplet information0
NMT-based Cross-lingual Document Embeddings0
Multilingual Seq2seq Training with Similarity Loss for Cross-Lingual Document Classification0
A Multi-task Approach to Learning Multilingual Representations0
Variational learning across domains with triplet information0
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus0
Learning Monolingual Compositional Representations via Bilingual Supervision0
Learning Crosslingual Word Embeddings without Bilingual CorporaCode0
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment ClassificationCode0
Learning Cross-lingual Representations with Matrix Factorization0
Learning Cross-lingual Word Embeddings via Matrix Co-factorization0
BilBOWA: Fast Bilingual Distributed Representations without Word AlignmentsCode0
A Multiplicative Model for Learning Distributed Text-Based Attribute Representations0
Multilingual Models for Compositional Distributed SemanticsCode0
Multilingual Distributed Representations without Word AlignmentCode0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.05Unverified
2MultiFiT, pseudoAccuracy89.42Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.95Unverified
4BiLSTM (UN)Accuracy74.52Unverified
5BiLSTM (Europarl)Accuracy72.83Unverified
6MultiCCA + CNNAccuracy72.38Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.8Unverified
2MultiFiT, pseudoAccuracy79.1Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.33Unverified
4MultiCCA + CNNAccuracy72.5Unverified
5BiLSTM (UN)Accuracy69.5Unverified
6BiLSTM (Europarl)Accuracy66.65Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy93.32Unverified
2MultiFiT, pseudoAccuracy82.48Unverified
3MultiCCA + CNNAccuracy74.73Unverified
4BiLSTM (UN)Accuracy71.97Unverified
5Massively Multilingual Sentence EmbeddingsAccuracy71.93Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.95Unverified
2MultiFiT, pseudoAccuracy91.62Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy84.78Unverified
4MultiCCA + CNNAccuracy81.2Unverified
5BiLSTM (Europarl)Accuracy71.83Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy89.7Unverified
2MultiFiT, pseudoAccuracy67.83Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy67.78Unverified
4BiLSTM (UN)Accuracy61.42Unverified
5MultiCCA + CNNAccuracy60.8Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy76.02Unverified
2Massively Multilingual Sentence EmbeddingsAccuracy69.43Unverified
3MultiCCA + CNNAccuracy69.38Unverified
4BiLSTM (Europarl)Accuracy60.73Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy69.57Unverified
2MultiCCA + CNNAccuracy67.63Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy60.3Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy92.7Unverified
2Bi+Accuracy88.1Unverified
3biCVM+Accuracy86.2Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy84.4Unverified
2Bi+Accuracy79.2Unverified
3biCVM+Accuracy76.9Unverified
#ModelMetricClaimedVerifiedStatus
1BiLSTM (Europarl)Accuracy75.45Unverified