SOTAVerified

Large Language Model

Papers

Showing 151200 of 6097 papers

TitleStatusHype
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation InferenceCode4
Galactica: A Large Language Model for ScienceCode4
Fast Transformer Decoding: One Write-Head is All You NeedCode4
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
G-Memory: Tracing Hierarchical Memory for Multi-Agent SystemsCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language ModelsCode3
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Evaluation Report on MCP ServersCode3
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning TasksCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model CompressionCode3
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and EditingCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
Prompt-to-LeaderboardCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem ProvingCode3
Multi-agent Architecture Search via Agentic SupernetCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
Partially Rewriting a Transformer in Natural LanguageCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
DARWIN 1.5: Large Language Models as Materials Science Adapted LearnersCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
BayLing 2: A Multilingual Large Language Model with Efficient Language AlignmentCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
Baichuan-Omni Technical ReportCode3
Towards Next-Generation LLM-based Recommender Systems: A Survey and BeyondCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Programming Every Example: Lifting Pre-training Data Quality like Experts at ScaleCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
Odyssey: Empowering Minecraft Agents with Open-World SkillsCode3
Show:102550
← PrevPage 4 of 122Next →

No leaderboard results yet.