SOTAVerified

Large Language Model

Papers

Showing 201250 of 6097 papers

TitleStatusHype
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
4D Panoptic Scene Graph GenerationCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
Baichuan-Omni Technical ReportCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed TomographyCode3
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
Odyssey: Empowering Minecraft Agents with Open-World SkillsCode3
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningCode3
Evaluation Report on MCP ServersCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
Partially Rewriting a Transformer in Natural LanguageCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language ModelsCode3
Chat-Edit-3D: Interactive 3D Scene Editing via Text PromptsCode3
GNN-RAG: Graph Neural Retrieval for Large Language Model ReasoningCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
Multi-agent Architecture Search via Agentic SupernetCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback LearningCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
CHESS: Contextual Harnessing for Efficient SQL SynthesisCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
Multimodal Table UnderstandingCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Detecting hallucinations in large language models using semantic entropyCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
Llemma: An Open Language Model For MathematicsCode3
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient InferenceCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
DARWIN 1.5: Large Language Models as Materials Science Adapted LearnersCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve ClassificationCode3
GroundingGPT:Language Enhanced Multi-modal Grounding ModelCode3
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Show:102550
← PrevPage 5 of 122Next →

No leaderboard results yet.