SOTAVerified

Sequential Decision Making

Papers

Showing 701750 of 1210 papers

TitleStatusHype
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesCode0
Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device PlacementCode0
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making0
Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation0
Automated Reinforcement Learning: An Overview0
Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning0
Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems0
State of the Art of User Simulation approaches for conversational information retrieval0
Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing0
Socially-Optimal Mechanism Design for Incentivized Online Learning0
Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions0
A Survey on Interpretable Reinforcement Learning0
Differentially Private Regret Minimization in Episodic Markov Decision ProcessesCode0
Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms0
Application of Deep Reinforcement Learning to Payment Fraud0
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach0
MDPFuzz: Testing Models Solving Markov Decision Processes0
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning0
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
Pessimistic Model Selection for Offline Deep Reinforcement Learning0
Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation0
Neural Column Generation for Capacitated Vehicle Routing0
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability0
Adversarial Deep Learning for Online Resource Allocation0
Deep Reinforcement Learning for Entity Alignment0
Route Optimization via Environment-Aware Deep Network and Reinforcement Learning0
AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive CrossbarsCode0
Automatic Goal Generation using Dynamical Distance Learning0
SOPE: Spectrum of Off-Policy EstimatorsCode0
Regular Decision Processes for Grid Worlds0
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement LearningCode1
Partial-Adaptive Submodular Maximization0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
Object-Aware Regularization for Addressing Causal Confusion in Imitation LearningCode1
The Value of Information When Deciding What to Learn0
Dynamic Causal Bayesian OptimizationCode1
HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties0
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits0
ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive ModelsCode0
Anti-Concentrated Confidence Bonuses for Scalable Exploration0
Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized RecommendationsCode0
SS-MAIL: Self-Supervised Multi-Agent Imitation Learning0
Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits0
Medical Dead-ends and Learning to Identify High-risk States and TreatmentsCode1
Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations0
Gambits: Theory and Evidence0
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.