SOTAVerified

Sequential Decision Making

Papers

Showing 801850 of 1210 papers

TitleStatusHype
Loss Functions for Multiset Prediction0
Low-rank finetuning for LLMs: A fairness perspective0
Lyapunov-based uncertainty-aware safe reinforcement learning0
Macro-action Multi-time scale Dynamic Programming for Energy Management in Buildings with Phase Change Materials0
Managing Risk using Rolling Forecasts in Energy-Limited and Stochastic Energy Systems0
Marginal and Joint Cross-Entropies & Predictives for Online Bayesian Inference, Active Learning, and Active Sampling0
Market Making without Regret0
Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow0
Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery0
Maximizing Ensemble Diversity in Deep Reinforcement Learning0
Maximizing utility in multi-agent environments by anticipating the behavior of other learners0
Maximum Entropy Monte-Carlo Planning0
MDPFuzz: Testing Models Solving Markov Decision Processes0
MDP Geometry, Normalization and Reward Balancing Solvers0
MDPs with a State Sensing Cost0
Meta Clustering of Neural Bandits0
Meta-Learning for Multi-objective Reinforcement Learning0
Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners0
Meta-Learning Hypothesis Spaces for Sequential Decision-making0
Metalearning Linear Bandits by Prior Update0
Meta-Learning Reliable Priors in the Function Space0
Meta-Learning surrogate models for sequential decision making0
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization0
MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning0
Minimax-optimal trust-aware multi-armed bandits0
Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making0
Modality-Buffet for Real-Time Object Detection0
Model Adaptation for Time Constrained Embodied Control0
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey0
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models0
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects0
Model-based Reinforcement Learning: A Survey0
Model Based Reinforcement Learning for Personalized Heparin Dosing0
Model-based trajectory stitching for improved behavioural cloning and its applications0
Model-Free Control of Thermostatically Controlled Loads Connected to a District Heating Network0
Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games0
Modeling driver's evasive behavior during safety-critical lane changes:Two-dimensional time-to-collision and deep reinforcement learning0
Modeling Human Driving Behavior through Generative Adversarial Imitation Learning0
Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing0
Monte Carlo Sampling for Regret Minimization in Extensive Games0
Monte-Carlo Tree Search for Constrained POMDPs0
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories0
Movement Penalized Bayesian Optimization with Application to Wind Energy Systems0
MPC-Inspired Neural Network Policies for Sequential Decision Making0
Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP0
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms0
Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs0
Multi-armed Bandits: Competing with Optimal Sequences0
Show:102550
← PrevPage 17 of 25Next →

No leaderboard results yet.