SOTAVerified

Large Language Model

Papers

Showing 51100 of 6097 papers

TitleStatusHype
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
Generative Agents: Interactive Simulacra of Human BehaviorCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and EditingCode5
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to PosttrainingCode5
Generating Physically Stable and Buildable LEGO Designs from TextCode5
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal DecompositionCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music GenerationCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM IntegrationCode5
GRUtopia: Dream General Robots in a City at ScaleCode5
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksCode5
RLHF Workflow: From Reward Modeling to Online RLHFCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Code5
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic AlignmentCode5
LAB: Large-Scale Alignment for ChatBotsCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
Retrieval-Augmented Generation for AI-Generated Content: A SurveyCode5
Datasets for Large Language Models: A Comprehensive SurveyCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
Executable Code Actions Elicit Better LLM AgentsCode5
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsCode5
Large Language Model based Multi-Agents: A Survey of Progress and ChallengesCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and ProspectsCode5
StarVector: Generating Scalable Vector Graphics Code from Images and TextCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
Ferret: Refer and Ground Anything Anywhere at Any GranularityCode5
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model ServingCode5
The Rise and Potential of Large Language Model Based Agents: A SurveyCode5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language ModelCode5
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPUCode5
Seed-Coder: Let the Code Model Curate Data for ItselfCode4
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and UnderstandingCode4
A Survey of LLM DATACode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language ModelCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
Show:102550
← PrevPage 2 of 122Next →

No leaderboard results yet.