SOTAVerified

Language Modeling

Papers

Showing 401425 of 14182 papers

TitleStatusHype
Cleaner Pretraining Corpus Curation with Neural Web ScrapingCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
8-bit Optimizers via Block-wise QuantizationCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
LaViDa: A Large Diffusion Language Model for Multimodal UnderstandingCode3
A Systematic Evaluation of Large Language Models of CodeCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
Language Models are Few-Shot LearnersCode3
Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective TasksCode3
Language Model InversionCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RLCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback LearningCode3
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive SurveyCode3
A Comprehensive Survey on Long Context Language ModelingCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue DataCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
Llemma: An Open Language Model For MathematicsCode3
Show:102550
← PrevPage 17 of 568Next →

No leaderboard results yet.