SOTAVerified

Sequential Decision Making

Papers

Showing 801850 of 1210 papers

TitleStatusHype
The Medkit-Learn(ing) Environment: Medical Decision Modelling through SimulationCode1
Curriculum Design for Teaching via Demonstrations: Theory and ApplicationsCode0
Meta-Learning Reliable Priors in the Function Space0
Heuristic-Guided Reinforcement Learning0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning0
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning0
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning0
Robust optimal policies for team Markov games0
Bandit based centralized matching in two-sided markets for peer to peer lending0
Data-Efficient Reinforcement Learning for Malaria Control0
Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization0
Statistical Inference with M-Estimators on Adaptively Collected Data0
Universal Off-Policy EvaluationCode0
Reinforcement Learning using Guided Observability0
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning0
Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images0
Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning0
Ecole: A Library for Learning Inside MILP SolversCode0
Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints0
Online Convex Optimization with Continuous Switching Constraint0
Forward and Backward Bellman equations improve the efficiency of EM algorithm for DEC-POMDP0
Situated Language Learning via Interactive Narratives0
Encrypted Linear Contextual Bandit0
Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners0
Optimal sequential decision making with probabilistic digital twins0
Automatic Goal Generation using Dynamical Distance Learning0
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games0
Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games0
Adversarial Environment Generation for Learning to Navigate the WebCode0
Batched Neural Bandits0
Hyperparameter Transfer Learning with Adaptive Complexity0
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and modelCode1
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models0
Causal Markov Decision Processes: Learning Good Interventions Efficiently0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form GamesCode0
Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Patterns, predictions, and actions: A story about machine learning0
An Analysis of Frame-skipping in Reinforcement Learning0
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management0
Improving Human Decision-Making by Discovering Efficient Strategies for Hierarchical Planning0
Reinforcement Learning for Freight Booking Control Problems0
CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation0
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors0
High-Confidence Off-Policy (or Counterfactual) Variance Estimation0
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning0
An empirical evaluation of active inference in multi-armed banditsCode1
Show:102550
← PrevPage 17 of 25Next →

No leaderboard results yet.