SOTAVerified

Multilingual NLP

Papers

Showing 150 of 96 papers

TitleStatusHype
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
Improving Word Translation via Two-Stage Contrastive LearningCode1
Improving Bilingual Lexicon Induction with Cross-Encoder RerankingCode1
WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NERCode1
HONEST: Measuring Hurtful Sentence Completion in Language ModelsCode1
Unsupervised Cross-lingual Representation Learning at ScaleCode1
fugashi, a Tool for Tokenizing Japanese in PythonCode1
DetIE: Multilingual Open Information Extraction Inspired by Object DetectionCode1
Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language ProcessingCode1
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented LanguagesCode1
Simultaneous Translation and Paraphrase for Language EducationCode1
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic LanguagesCode1
PMIndia -- A Collection of Parallel Corpora of Languages of IndiaCode1
On Bilingual Lexicon Induction with Large Language ModelsCode1
Improving Word Translation via Two-Stage Contrastive LearningCode1
Language-agnostic BERT Sentence EmbeddingCode1
TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLPCode0
A General-Purpose Multilingual Document EncoderCode0
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data SetsCode0
Analysing The Impact Of Linguistic Features On Cross-Lingual TransferCode0
Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis ModelsCode0
BaitBuster-Bangla: A Comprehensive Dataset for Clickbait Detection in Bangla with Multi-Feature and Multi-Modal AnalysisCode0
Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification GraphsCode0
Cultural and Geographical Influences on Image Translatability of Words across LanguagesCode0
Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT EmbeddingsCode0
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis ResponseCode0
Improving Cross-Lingual Word Embeddings by Meeting in the MiddleCode0
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology AnalysisCode0
MMCR4NLP: Multilingual Multiway Corpora Repository for Natural Language ProcessingCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News RecommendationCode0
PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document GenerationCode0
PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in IndiaCode0
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy ModelsCode0
Self-Augmentation Improves Zero-Shot Cross-Lingual TransferCode0
Self-Augmented In-Context Learning for Unsupervised Word TranslationCode0
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual EvaluationCode0
SICK-NL: A Dataset for Dutch Natural Language InferenceCode0
SICKNL: A Dataset for Dutch Natural Language InferenceCode0
UQA: Corpus for Urdu Question AnsweringCode0
What Drives Performance in Multilingual Language Models?Code0
What is "Typological Diversity" in NLP?Code0
XeroAlign: Zero-Shot Cross-lingual Transformer AlignmentCode0
I’ve got a construction looks funny – representing and recovering non-standard constructions in UD0
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools0
LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization0
Challenges and Strategies in Cross-Cultural NLP0
Leveraging Large Language Models to Measure Gender Representation Bias in Gendered Language Corpora0
A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base0
Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.