SOTAVerified

Large Language Model

Papers

Showing 5175 of 6097 papers

TitleStatusHype
Generative Agents: Interactive Simulacra of Human BehaviorCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic AlignmentCode5
LAB: Large-Scale Alignment for ChatBotsCode5
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal DecompositionCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
GRUtopia: Dream General Robots in a City at ScaleCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPUCode5
Datasets for Large Language Models: A Comprehensive SurveyCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music GenerationCode5
Generating Physically Stable and Buildable LEGO Designs from TextCode5
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM IntegrationCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language ModelCode5
Show:102550
← PrevPage 3 of 244Next →

No leaderboard results yet.