SOTAVerified

Multilingual NLP

Papers

Showing 150 of 96 papers

TitleStatusHype
Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks0
SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods0
LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization0
Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities0
HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing0
Cross-Linguistic Transfer in Multilingual NLP: The Role of Language Families and Morphology0
Multilingual Prompt Engineering in Large Language Models: A Survey Across NLP Tasks0
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting0
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
Code-Mixed Telugu-English Hate Speech Detection0
How does a Multilingual LM Handle Multiple Languages?0
How Good is Your Wikipedia? Auditing Data Quality for Low-resource and Multilingual NLP0
Don't Touch My Diacritics0
SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval0
Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities0
Dravidian language family through Universal Dependencies lens0
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?0
Leveraging Large Language Models to Measure Gender Representation Bias in Gendered Language Corpora0
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News RecommendationCode0
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy ModelsCode0
Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks0
A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base0
Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis ModelsCode0
UQA: Corpus for Urdu Question AnsweringCode0
What Drives Performance in Multilingual Language Models?Code0
Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili0
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models0
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data SetsCode0
Self-Augmented In-Context Learning for Unsupervised Word TranslationCode0
What is "Typological Diversity" in NLP?Code0
Patterns of Persistence and Diffusibility across the World's Languages0
Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages0
Multi-teacher Distillation for Multilingual Spelling Correction0
On Bilingual Lexicon Induction with Large Language ModelsCode1
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems0
BaitBuster-Bangla: A Comprehensive Dataset for Clickbait Detection in Bangla with Multi-Feature and Multi-Modal AnalysisCode0
Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT EmbeddingsCode0
Zero-shot Cross-lingual Transfer without Parallel Corpus0
Self-Augmentation Improves Zero-Shot Cross-Lingual TransferCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification GraphsCode0
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented LanguagesCode1
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in IndiaCode0
A General-Purpose Multilingual Document EncoderCode0
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning0
PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document GenerationCode0
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
Improving Bilingual Lexicon Induction with Cross-Encoder RerankingCode1
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis ResponseCode0
How Do Multilingual Encoders Learn Cross-lingual Representation?0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.