SOTAVerified

Decision Making

Papers

Showing 226250 of 12311 papers

TitleStatusHype
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement LearningCode1
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data SynthesisCode1
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and ActingCode1
K^2VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series ForecastingCode1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
Structured Reinforcement Learning for Combinatorial Decision-MakingCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision MakingCode1
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
SmartPilot: A Multiagent CoPilot for Adaptive and Intelligent ManufacturingCode1
VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction TuningCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical ReasoningCode1
Urban Computing in the Era of Large Language ModelsCode1
Language Guided Concept Bottleneck Models for Interpretable Continual LearningCode1
A friendly introduction to triangular transportCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape RoomsCode1
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM PlanningCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
CryptoPulse: Short-Term Cryptocurrency Forecasting with Dual-Prediction and Cross-Correlated Market IndicatorsCode1
Training a Generally Curious AgentCode1
Show:102550
← PrevPage 10 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified