SOTAVerified

Decision Making

Papers

Showing 176200 of 12311 papers

TitleStatusHype
Diffusion Actor-Critic with Entropy RegulatorCode2
Adversarial attacks and defenses in explainable artificial intelligence: A surveyCode2
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAXCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Cross-Prediction-Powered InferenceCode2
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language ModelsCode2
LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language ModelsCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Cumulative Reasoning with Large Language ModelsCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Concept Bottleneck Language Models For protein designCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement LearningCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
ADAPT: Action-aware Driving Caption TransformerCode2
Show:102550
← PrevPage 8 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified