SOTAVerified

Language Modeling

Papers

Showing 401425 of 14182 papers

TitleStatusHype
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient InferenceCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language ModelCode3
GLM: General Language Model Pretraining with Autoregressive Blank InfillingCode3
8-bit Optimizers via Block-wise QuantizationCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
ContextCite: Attributing Model Generation to ContextCode3
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model InferenceCode3
LaViDa: A Large Diffusion Language Model for Multimodal UnderstandingCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Llemma: An Open Language Model For MathematicsCode3
Language Models are Few-Shot LearnersCode3
Cleaner Pretraining Corpus Curation with Neural Web ScrapingCode3
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue DataCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
Language Model InversionCode3
Agent Workflow MemoryCode3
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning TasksCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
A Comprehensive Survey on Long Context Language ModelingCode3
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive SurveyCode3
Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective TasksCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
Show:102550
← PrevPage 17 of 568Next →

No leaderboard results yet.