SOTAVerified

Decision Making

Papers

Showing 201250 of 12311 papers

TitleStatusHype
DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change DetectionCode2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing ModelCode2
Neuro-Nav: A Library for Neurally-Plausible Reinforcement LearningCode2
OmniXAI: A Library for Explainable AICode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
Towards Explanation for Unsupervised Graph-Level Representation LearningCode2
Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing FlowsCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
OMLT: Optimization & Machine Learning ToolkitCode2
Pre-Trained Language Models for Interactive Decision-MakingCode2
NeuralProphet: Explainable Forecasting at ScaleCode2
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement LearningCode2
Polis: Scaling Deliberation by Mapping High Dimensional Opinion SpacesCode2
Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene ImagesCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
SoccerMap: A Deep Learning Architecture for Visually-Interpretable Analysis in SoccerCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing FlowsCode2
Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading StrategiesCode2
Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning BehaviorCode1
Diffusion-Based Electrocardiography Noise Quantification via Anomaly DetectionCode1
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement LearningCode1
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data SynthesisCode1
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and ActingCode1
K^2VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series ForecastingCode1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
Structured Reinforcement Learning for Combinatorial Decision-MakingCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision MakingCode1
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
SmartPilot: A Multiagent CoPilot for Adaptive and Intelligent ManufacturingCode1
VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction TuningCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical ReasoningCode1
Urban Computing in the Era of Large Language ModelsCode1
Language Guided Concept Bottleneck Models for Interpretable Continual LearningCode1
A friendly introduction to triangular transportCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape RoomsCode1
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM PlanningCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
CryptoPulse: Short-Term Cryptocurrency Forecasting with Dual-Prediction and Cross-Correlated Market IndicatorsCode1
Training a Generally Curious AgentCode1
Show:102550
← PrevPage 5 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified