SOTAVerified

Language Modeling

Papers

Showing 251300 of 14182 papers

TitleStatusHype
A Survey on the Optimization of Large Language Model-based AgentsCode3
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model CompressionCode3
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and EditingCode3
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language ModelsCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
Prompt-to-LeaderboardCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
Ola: Pushing the Frontiers of Omni-Modal Language ModelCode3
Multi-agent Architecture Search via Agentic SupernetCode3
Partially Rewriting a Transformer in Natural LanguageCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling CapabilitiesCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
In-situ graph reasoning and knowledge expansion using Graph-PReFLexORCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
YuLan-Mini: An Open Data-efficient Language ModelCode3
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive SurveyCode3
Embodied CoT Distillation From LLM To Off-the-shelf AgentsCode3
BatchTopK Sparse AutoencodersCode3
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue DataCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
On the Efficiency of NLP-Inspired Methods for Tabular Deep LearningCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
BayLing 2: A Multilingual Large Language Model with Efficient Language AlignmentCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
The Surprising Effectiveness of Test-Time Training for Few-Shot LearningCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
Rule Based Rewards for Language Model SafetyCode3
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software ImprovementCode3
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse AutoencodersCode3
Centaur: a foundation model of human cognitionCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
Scaling up Masked Diffusion Models on TextCode3
Scaling Diffusion Language Models via Adaptation from Autoregressive ModelsCode3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic ThinkingCode3
Predicting from Strings: Language Model Embeddings for Bayesian OptimizationCode3
Baichuan-Omni Technical ReportCode3
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model InferenceCode3
Show:102550
← PrevPage 6 of 284Next →

No leaderboard results yet.