SOTAVerified

Cross-Lingual Document Classification

Cross-lingual document classification refers to the task of using data and models available for one language for which ample such resources are available (e.g., English) to solve classification tasks in another, commonly low-resource, language.

Papers

Showing 110 of 25 papers

TitleStatusHype
Multilingual and cross-lingual document classification: A meta-learning approachCode1
A Corpus for Multilingual Document Classification in Eight LanguagesCode1
MultiFiT: Efficient Multi-lingual Language Model Fine-tuningCode1
ZeRO: Memory Optimizations Toward Training Trillion Parameter ModelsCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification0
A Multiplicative Model for Learning Distributed Text-Based Attribute Representations0
Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling0
Learning Cross-lingual Word Embeddings via Matrix Co-factorization0
A Multi-task Approach to Learning Multilingual Representations0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.05Unverified
2MultiFiT, pseudoAccuracy89.42Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.95Unverified
4BiLSTM (UN)Accuracy74.52Unverified
5BiLSTM (Europarl)Accuracy72.83Unverified
6MultiCCA + CNNAccuracy72.38Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.8Unverified
2MultiFiT, pseudoAccuracy79.1Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.33Unverified
4MultiCCA + CNNAccuracy72.5Unverified
5BiLSTM (UN)Accuracy69.5Unverified
6BiLSTM (Europarl)Accuracy66.65Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy93.32Unverified
2MultiFiT, pseudoAccuracy82.48Unverified
3MultiCCA + CNNAccuracy74.73Unverified
4BiLSTM (UN)Accuracy71.97Unverified
5Massively Multilingual Sentence EmbeddingsAccuracy71.93Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.95Unverified
2MultiFiT, pseudoAccuracy91.62Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy84.78Unverified
4MultiCCA + CNNAccuracy81.2Unverified
5BiLSTM (Europarl)Accuracy71.83Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy89.7Unverified
2MultiFiT, pseudoAccuracy67.83Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy67.78Unverified
4BiLSTM (UN)Accuracy61.42Unverified
5MultiCCA + CNNAccuracy60.8Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy76.02Unverified
2Massively Multilingual Sentence EmbeddingsAccuracy69.43Unverified
3MultiCCA + CNNAccuracy69.38Unverified
4BiLSTM (Europarl)Accuracy60.73Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy69.57Unverified
2MultiCCA + CNNAccuracy67.63Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy60.3Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy92.7Unverified
2Bi+Accuracy88.1Unverified
3biCVM+Accuracy86.2Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy84.4Unverified
2Bi+Accuracy79.2Unverified
3biCVM+Accuracy76.9Unverified
#ModelMetricClaimedVerifiedStatus
1BiLSTM (Europarl)Accuracy75.45Unverified