SOTAVerified

Cross-Lingual Document Classification

Cross-lingual document classification refers to the task of using data and models available for one language for which ample such resources are available (e.g., English) to solve classification tasks in another, commonly low-resource, language.

Papers

Showing 110 of 25 papers

TitleStatusHype
MultiFiT: Efficient Multi-lingual Language Model Fine-tuningCode1
A Corpus for Multilingual Document Classification in Eight LanguagesCode1
Multilingual and cross-lingual document classification: A meta-learning approachCode1
ZeRO: Memory Optimizations Toward Training Trillion Parameter ModelsCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment ClassificationCode0
Bridging the domain gap in cross-lingual document classificationCode0
Multilingual Distributed Representations without Word AlignmentCode0
BilBOWA: Fast Bilingual Distributed Representations without Word AlignmentsCode0
Learning Crosslingual Word Embeddings without Bilingual CorporaCode0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.05Unverified
2MultiFiT, pseudoAccuracy89.42Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.95Unverified
4BiLSTM (UN)Accuracy74.52Unverified
5BiLSTM (Europarl)Accuracy72.83Unverified
6MultiCCA + CNNAccuracy72.38Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.8Unverified
2MultiFiT, pseudoAccuracy79.1Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.33Unverified
4MultiCCA + CNNAccuracy72.5Unverified
5BiLSTM (UN)Accuracy69.5Unverified
6BiLSTM (Europarl)Accuracy66.65Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy93.32Unverified
2MultiFiT, pseudoAccuracy82.48Unverified
3MultiCCA + CNNAccuracy74.73Unverified
4BiLSTM (UN)Accuracy71.97Unverified
5Massively Multilingual Sentence EmbeddingsAccuracy71.93Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.95Unverified
2MultiFiT, pseudoAccuracy91.62Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy84.78Unverified
4MultiCCA + CNNAccuracy81.2Unverified
5BiLSTM (Europarl)Accuracy71.83Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy89.7Unverified
2MultiFiT, pseudoAccuracy67.83Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy67.78Unverified
4BiLSTM (UN)Accuracy61.42Unverified
5MultiCCA + CNNAccuracy60.8Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy76.02Unverified
2Massively Multilingual Sentence EmbeddingsAccuracy69.43Unverified
3MultiCCA + CNNAccuracy69.38Unverified
4BiLSTM (Europarl)Accuracy60.73Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy69.57Unverified
2MultiCCA + CNNAccuracy67.63Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy60.3Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy92.7Unverified
2Bi+Accuracy88.1Unverified
3biCVM+Accuracy86.2Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy84.4Unverified
2Bi+Accuracy79.2Unverified
3biCVM+Accuracy76.9Unverified
#ModelMetricClaimedVerifiedStatus
1BiLSTM (Europarl)Accuracy75.45Unverified