SOTAVerified

Cross-Lingual Document Classification

Cross-lingual document classification refers to the task of using data and models available for one language for which ample such resources are available (e.g., English) to solve classification tasks in another, commonly low-resource, language.

Papers

Showing 125 of 25 papers

TitleStatusHype
A Corpus for Multilingual Document Classification in Eight LanguagesCode1
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and BeyondCode1
MultiFiT: Efficient Multi-lingual Language Model Fine-tuningCode1
Multilingual and cross-lingual document classification: A meta-learning approachCode1
ZeRO: Memory Optimizations Toward Training Trillion Parameter ModelsCode1
Multilingual Seq2seq Training with Similarity Loss for Cross-Lingual Document Classification0
NMT-based Cross-lingual Document Embeddings0
Learning Cross-lingual Word Embeddings via Matrix Co-factorization0
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification0
A Multiplicative Model for Learning Distributed Text-Based Attribute Representations0
A Multi-task Approach to Learning Multilingual Representations0
Learning Monolingual Compositional Representations via Bilingual Supervision0
Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling0
Variational learning across domains with triplet information0
Variational learning across domains with triplet information0
Wasserstein distances for evaluating cross-lingual embeddings0
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus0
Learning Cross-lingual Representations with Matrix Factorization0
BilBOWA: Fast Bilingual Distributed Representations without Word AlignmentsCode0
Learning Crosslingual Word Embeddings without Bilingual CorporaCode0
Multilingual Distributed Representations without Word AlignmentCode0
Multilingual Models for Compositional Distributed SemanticsCode0
Robust Cross-lingual Embeddings from Parallel SentencesCode0
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment ClassificationCode0
Bridging the domain gap in cross-lingual document classificationCode0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.05Unverified
2MultiFiT, pseudoAccuracy89.42Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.95Unverified
4BiLSTM (UN)Accuracy74.52Unverified
5BiLSTM (Europarl)Accuracy72.83Unverified
6MultiCCA + CNNAccuracy72.38Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.8Unverified
2MultiFiT, pseudoAccuracy79.1Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy77.33Unverified
4MultiCCA + CNNAccuracy72.5Unverified
5BiLSTM (UN)Accuracy69.5Unverified
6BiLSTM (Europarl)Accuracy66.65Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy93.32Unverified
2MultiFiT, pseudoAccuracy82.48Unverified
3MultiCCA + CNNAccuracy74.73Unverified
4BiLSTM (UN)Accuracy71.97Unverified
5Massively Multilingual Sentence EmbeddingsAccuracy71.93Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy96.95Unverified
2MultiFiT, pseudoAccuracy91.62Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy84.78Unverified
4MultiCCA + CNNAccuracy81.2Unverified
5BiLSTM (Europarl)Accuracy71.83Unverified
#ModelMetricClaimedVerifiedStatus
1XLMft UDAAccuracy89.7Unverified
2MultiFiT, pseudoAccuracy67.83Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy67.78Unverified
4BiLSTM (UN)Accuracy61.42Unverified
5MultiCCA + CNNAccuracy60.8Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy76.02Unverified
2Massively Multilingual Sentence EmbeddingsAccuracy69.43Unverified
3MultiCCA + CNNAccuracy69.38Unverified
4BiLSTM (Europarl)Accuracy60.73Unverified
#ModelMetricClaimedVerifiedStatus
1MultiFiT, pseudoAccuracy69.57Unverified
2MultiCCA + CNNAccuracy67.63Unverified
3Massively Multilingual Sentence EmbeddingsAccuracy60.3Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy92.7Unverified
2Bi+Accuracy88.1Unverified
3biCVM+Accuracy86.2Unverified
#ModelMetricClaimedVerifiedStatus
1Biinclusion (Euro500kReuters)Accuracy84.4Unverified
2Bi+Accuracy79.2Unverified
3biCVM+Accuracy76.9Unverified
#ModelMetricClaimedVerifiedStatus
1BiLSTM (Europarl)Accuracy75.45Unverified