SOTAVerified

Language Modeling

Papers

Showing 151200 of 14182 papers

TitleStatusHype
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLMCode4
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth ApproachCode4
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video UnderstandingCode4
Training Software Engineering Agents and Verifiers with SWE-GymCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
SepLLM: Accelerate Large Language Models by Compressing One Segment into One SeparatorCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
SNAC: Multi-Scale Neural Audio CodecCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
Large Language Model-Based Agents for Software Engineering: A SurveyCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree SearchCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
The Llama 3 Herd of ModelsCode4
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world EnvironmentsCode4
SEED-Story: Multimodal Long Story Generation with Large Language ModelCode4
YuLan: An Open-source Large Language ModelCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
Long Context Transfer from Language to VisionCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Simple and Effective Masked Diffusion Language ModelsCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language ModelsCode4
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
Self-Play Preference Optimization for Language Model AlignmentCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attentionCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
Sailor: Open Language Models for South-East AsiaCode4
A Survey on Large Language Model-Based Game AgentsCode4
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextCode4
RewardBench: Evaluating Reward Models for Language ModelingCode4
UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised PretrainingCode4
Tower: An Open Multilingual Large Language Model for Translation-Related TasksCode4
Show:102550
← PrevPage 4 of 284Next →

No leaderboard results yet.