SOTAVerified

Decision Making

Papers

Showing 101125 of 12311 papers

TitleStatusHype
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing ModelCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
Distributional Soft Actor-Critic with Three RefinementsCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language ModelsCode2
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
Disentangling Memory and Reasoning Ability in Large Language ModelsCode2
Adversarial attacks and defenses in explainable artificial intelligence: A surveyCode2
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in PythonCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Improving Causal Reasoning in Large Language Models: A SurveyCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Show:102550
← PrevPage 5 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified