SOTAVerified

Language Modeling

Papers

Showing 31263150 of 14182 papers

TitleStatusHype
CTRL: A Conditional Transformer Language Model for Controllable GenerationCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
Towards Evaluating Generalist Agents: An Automated Benchmark in Open WorldCode1
Markovian Transformers for Informative Language ModelingCode1
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test ConstructionCode1
Avoiding Inference Heuristics in Few-shot Prompt-based FinetuningCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Invariant Language ModelingCode1
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language TechnologiesCode1
InvestLM: A Large Language Model for Investment using Financial Domain Instruction TuningCode1
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model RecommendationCode1
IoT-LM: Large Multisensory Language Models for the Internet of ThingsCode1
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language ModelingCode1
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed LanguageCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question AnsweringCode1
Mapping Memes to Words for Multimodal Hateful Meme ClassificationCode1
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNACode1
Talking-Heads AttentionCode1
MarianCG: a code generation transformer model inspired by machine translationCode1
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
TAPEX: Table Pre-training via Learning a Neural SQL ExecutorCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Show:102550
← PrevPage 126 of 568Next →

No leaderboard results yet.