SOTAVerified

Language Modeling

Papers

Showing 29512975 of 14182 papers

TitleStatusHype
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language ModelsCode1
A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue SystemsCode1
Hierarchical Transformers Are More Efficient Language ModelsCode1
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based PolishingCode1
Automatic Controllable Product Copywriting for E-CommerceCode1
Dealing with Typos for BERT-based Passage Retrieval and RankingCode1
AdaSplash: Adaptive Sparse Flash AttentionCode1
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model GenerationCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
Scalable-Softmax Is Superior for AttentionCode1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language ModelsCode1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade DevicesCode1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field EnlargementCode1
Making Language Models Better Tool Learners with Execution FeedbackCode1
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?Code1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
Scaling Large Language Model-based Multi-Agent CollaborationCode1
Housekeep: Tidying Virtual Households using Commonsense ReasoningCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
LLMs Can Simulate Standardized Patients via Agent CoevolutionCode1
How Language Model Hallucinations Can SnowballCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Show:102550
← PrevPage 119 of 568Next →

No leaderboard results yet.