SOTAVerified

Language Modeling

Papers

Showing 151175 of 14182 papers

TitleStatusHype
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLMCode4
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth ApproachCode4
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video UnderstandingCode4
Training Software Engineering Agents and Verifiers with SWE-GymCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
SepLLM: Accelerate Large Language Models by Compressing One Segment into One SeparatorCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
SNAC: Multi-Scale Neural Audio CodecCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
Large Language Model-Based Agents for Software Engineering: A SurveyCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree SearchCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
The Llama 3 Herd of ModelsCode4
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world EnvironmentsCode4
Show:102550
← PrevPage 7 of 568Next →

No leaderboard results yet.