SOTAVerified

Multilingual NLP

Papers

Showing 2650 of 96 papers

TitleStatusHype
Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili0
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models0
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data SetsCode0
Self-Augmented In-Context Learning for Unsupervised Word TranslationCode0
What is "Typological Diversity" in NLP?Code0
Patterns of Persistence and Diffusibility across the World's Languages0
Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages0
Multi-teacher Distillation for Multilingual Spelling Correction0
On Bilingual Lexicon Induction with Large Language ModelsCode1
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems0
BaitBuster-Bangla: A Comprehensive Dataset for Clickbait Detection in Bangla with Multi-Feature and Multi-Modal AnalysisCode0
Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT EmbeddingsCode0
Zero-shot Cross-lingual Transfer without Parallel Corpus0
Self-Augmentation Improves Zero-Shot Cross-Lingual TransferCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification GraphsCode0
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented LanguagesCode1
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in IndiaCode0
A General-Purpose Multilingual Document EncoderCode0
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning0
PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document GenerationCode0
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
Improving Bilingual Lexicon Induction with Cross-Encoder RerankingCode1
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis ResponseCode0
How Do Multilingual Encoders Learn Cross-lingual Representation?0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.