SOTAVerified

Large Language Model

Papers

Showing 101150 of 6097 papers

TitleStatusHype
LISA: Reasoning Segmentation via Large Language ModelCode4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesCode4
Tower: An Open Multilingual Large Language Model for Translation-Related TasksCode4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task AdaptationCode4
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language ModelsCode4
Large Language Model-Based Agents for Software Engineering: A SurveyCode4
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and UnderstandingCode4
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language ModelsCode4
Seed-Coder: Let the Code Model Curate Data for ItselfCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
AnyGPT: Unified Multimodal LLM with Discrete Sequence ModelingCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank AdaptationCode4
SEED-Story: Multimodal Long Story Generation with Large Language ModelCode4
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsCode4
How is ChatGPT's behavior changing over time?Code4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
A Survey on Large Language Model based Autonomous AgentsCode4
Generative Representational Instruction TuningCode4
A Survey on Large Language Model-Based Game AgentsCode4
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality CollaborationCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
A Survey of LLM DATACode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Galactica: A Large Language Model for ScienceCode4
Fast Transformer Decoding: One Write-Head is All You NeedCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation InferenceCode4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
A-MEM: Agentic Memory for LLM AgentsCode4
Show:102550
← PrevPage 3 of 122Next →

No leaderboard results yet.