SOTAVerified

Large Language Model

Papers

Showing 151175 of 6097 papers

TitleStatusHype
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
Multimodal Table UnderstandingCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
Multi-agent Architecture Search via Agentic SupernetCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
Evaluation Report on MCP ServersCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
Odyssey: Empowering Minecraft Agents with Open-World SkillsCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
MeshXL: Neural Coordinate Field for Generative 3D Foundation ModelsCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
Detecting hallucinations in large language models using semantic entropyCode3
Show:102550
← PrevPage 7 of 244Next →

No leaderboard results yet.