SOTAVerified

Multilingual NLP

Papers

Showing 150 of 96 papers

TitleStatusHype
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
On Bilingual Lexicon Induction with Large Language ModelsCode1
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented LanguagesCode1
Improving Bilingual Lexicon Induction with Cross-Encoder RerankingCode1
DetIE: Multilingual Open Information Extraction Inspired by Object DetectionCode1
Improving Word Translation via Two-Stage Contrastive LearningCode1
Improving Word Translation via Two-Stage Contrastive LearningCode1
WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NERCode1
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic LanguagesCode1
Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language ProcessingCode1
fugashi, a Tool for Tokenizing Japanese in PythonCode1
Language-agnostic BERT Sentence EmbeddingCode1
Simultaneous Translation and Paraphrase for Language EducationCode1
PMIndia -- A Collection of Parallel Corpora of Languages of IndiaCode1
Unsupervised Cross-lingual Representation Learning at ScaleCode1
Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks0
SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods0
LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization0
Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities0
Cross-Linguistic Transfer in Multilingual NLP: The Role of Language Families and Morphology0
HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing0
Multilingual Prompt Engineering in Large Language Models: A Survey Across NLP Tasks0
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting0
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
Code-Mixed Telugu-English Hate Speech Detection0
How does a Multilingual LM Handle Multiple Languages?0
How Good is Your Wikipedia? Auditing Data Quality for Low-resource and Multilingual NLP0
Don't Touch My Diacritics0
SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval0
Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities0
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?0
Dravidian language family through Universal Dependencies lens0
Leveraging Large Language Models to Measure Gender Representation Bias in Gendered Language Corpora0
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News RecommendationCode0
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy ModelsCode0
Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks0
A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base0
Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis ModelsCode0
UQA: Corpus for Urdu Question AnsweringCode0
What Drives Performance in Multilingual Language Models?Code0
Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili0
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models0
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data SetsCode0
Self-Augmented In-Context Learning for Unsupervised Word TranslationCode0
What is "Typological Diversity" in NLP?Code0
Patterns of Persistence and Diffusibility across the World's Languages0
Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages0
Multi-teacher Distillation for Multilingual Spelling Correction0
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.