SOTAVerified

Decision Making

Papers

Showing 101110 of 12311 papers

TitleStatusHype
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Cumulative Reasoning with Large Language ModelsCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
ExpeL: LLM Agents Are Experiential LearnersCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
Disentangling Memory and Reasoning Ability in Large Language ModelsCode2
Agentic Knowledgeable Self-awarenessCode2
Show:102550
← PrevPage 11 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified