SOTAVerified

Large Language Model

Papers

Showing 101150 of 6097 papers

TitleStatusHype
A-MEM: Agentic Memory for LLM AgentsCode4
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
Liquid: Language Models are Scalable Multi-modal GeneratorsCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative RefinementCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
Large Language Model-Based Agents for Software Engineering: A SurveyCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world EnvironmentsCode4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
SEED-Story: Multimodal Long Story Generation with Large Language ModelCode4
YuLan: An Open-source Large Language ModelCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language ModelsCode4
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
A Survey on Large Language Model-Based Game AgentsCode4
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task AdaptationCode4
Tower: An Open Multilingual Large Language Model for Translation-Related TasksCode4
LLM Inference Unveiled: Survey and Roofline Model InsightsCode4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
AnyGPT: Unified Multimodal LLM with Discrete Sequence ModelingCode4
Generative Representational Instruction TuningCode4
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
Video-LLaVA: Learning United Visual Representation by Alignment Before ProjectionCode4
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language ModelsCode4
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality CollaborationCode4
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsCode4
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
A Survey on Large Language Model based Autonomous AgentsCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
How is ChatGPT's behavior changing over time?Code4
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank AdaptationCode4
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric TasksCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
Show:102550
← PrevPage 3 of 122Next →

No leaderboard results yet.