SOTAVerified

Sequential Decision Making

Papers

Showing 151200 of 1210 papers

TitleStatusHype
TALES: Text Adventure Learning Environment Suite0
Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs0
Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent DemandCode0
Truncated Matrix Completion - An Empirical Study0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
A Framework of decision-relevant observability: Reinforcement Learning converges under relative ignorability0
RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models0
Deep Reinforcement Learning Algorithms for Option HedgingCode0
A Classification View on Meta Learning Bandits0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories0
Counterfactual Inference under Thompson Sampling0
Towards Enabling Learning for Time-Varying finite horizon Sequential Decision-Making Problems*0
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding0
Remember, but also, Forget: Bridging Myopic and Perfect Recall Fairness with Past-Discounting0
Towards Trustworthy GUI Agents: A SurveyCode0
Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game ApproachCode0
Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining0
Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot NavigationCode0
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets0
Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian FrameworkCode0
Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes0
Depth Matters: Multimodal RGB-D Perception for Robust Autonomous AgentsCode0
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
Quantization-Free Autoregressive Action TransformerCode0
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach0
Zero-Shot Action Generalization with Limited Observations0
Locally Private Nonparametric Contextual Multi-armed BanditsCode0
Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference0
GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks0
Bayesian Graph Traversal0
Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning0
Shaping Laser Pulses with Reinforcement Learning0
Semi-Parametric Batched Global Multi-Armed Bandits with Covariates0
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesCode0
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningCode0
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning0
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Making Universal Policies UniversalCode0
Value Gradient Sampler: Sampling as Sequential Decision MakingCode0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control0
Self-Evaluation for Job-Shop Scheduling0
Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning0
A Survey on Explainable Deep Reinforcement Learning0
Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making0
Online Clustering of Dueling Bandits0
VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation0
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.