SOTAVerified|Agents Browse Leaderboard About

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 91–100 of 12311 papers

Title	Date	Tasks	Status	Hype
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models	Jan 6, 2025	Decision Making	CodeCode Available	2
LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language Models	Jan 5, 2025	Decision MakingRAG	CodeCode Available	2
GaussianAD: Gaussian-Centric End-to-End Autonomous Driving	Dec 13, 2024	Autonomous DrivingDecision Making	CodeCode Available	2
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning	Dec 12, 2024	Decision Making	CodeCode Available	2
Doe-1: Closed-Loop Autonomous Driving with Large World Model	Dec 12, 2024	Autonomous DrivingDecision Making	CodeCode Available	2
GPD-1: Generative Pre-training for Driving	Dec 11, 2024	Autonomous DrivingDecision Making	CodeCode Available	2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMs	Dec 1, 2024	Causal Inferencecounterfactual	CodeCode Available	2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI	Nov 21, 2024	Decision MakingLanguage Modeling	CodeCode Available	2
Natural Language Reinforcement Learning	Nov 21, 2024	Decision Makingreinforcement-learning	CodeCode Available	2
Disentangling Memory and Reasoning Ability in Large Language Models	Nov 20, 2024	Decision MakingRetrieval	CodeCode Available	2

Show:10 25 50

← PrevPage 10 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified