SOTAVerified

Large Language Model

Papers

Showing 101125 of 6097 papers

TitleStatusHype
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank AdaptationCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
A Survey of LLM DATACode4
A Survey on Large Language Model based Autonomous AgentsCode4
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality CollaborationCode4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
Generative Representational Instruction TuningCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
A Survey on Large Language Model-Based Game AgentsCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
Galactica: A Large Language Model for ScienceCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Fast Transformer Decoding: One Write-Head is All You NeedCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsCode4
Show:102550
← PrevPage 5 of 244Next →

No leaderboard results yet.