SOTAVerified

Decision Making

Papers

Showing 151200 of 12311 papers

TitleStatusHype
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental LearningCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
Distributional Soft Actor-Critic with Three RefinementsCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
ForecastBench: A Dynamic Benchmark of AI Forecasting CapabilitiesCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
Disentangling Memory and Reasoning Ability in Large Language ModelsCode2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language ModelsCode2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing ModelCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
Embodied LLM Agents Learn to Cooperate in Organized TeamsCode2
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character DesignCode2
Agentic Knowledgeable Self-awarenessCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Adversarial attacks and defenses in explainable artificial intelligence: A surveyCode2
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAXCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Cross-Prediction-Powered InferenceCode2
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language ModelsCode2
LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language ModelsCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Cumulative Reasoning with Large Language ModelsCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Concept Bottleneck Language Models For protein designCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement LearningCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
ADAPT: Action-aware Driving Caption TransformerCode2
Show:102550
← PrevPage 4 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified