User Retention-oriented Recommendation with Decision Transformer Mar 11, 2023 Contrastive Learning counterfactual
Code Code Available 1Provably Efficient Model-Free Algorithms for Non-stationary CMDPs Mar 10, 2023 Reinforcement Learning (RL)
— Unverified 0Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning Mar 10, 2023 Deep Reinforcement Learning Diversity
— Unverified 0Optimal foraging strategies can be learned Mar 10, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning Mar 9, 2023 Offline RL Q-Learning
Code Code Available 1Evolving Populations of Diverse RL Agents with MAP-Elites Mar 9, 2023 Reinforcement Learning (RL)
— Unverified 0GOATS: Goal Sampling Adaptation for Scooping with Curriculum Reinforcement Learning Mar 9, 2023 Position reinforcement-learning
— Unverified 0Real-time scheduling of renewable power systems through planning-based reinforcement learning Mar 9, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Framework for History-Aware Hyperparameter Optimisation in Reinforcement Learning Mar 9, 2023 Decision Making reinforcement-learning
— Unverified 0Conceptual Reinforcement Learning for Language-Conditioned Tasks Mar 9, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Computably Continuous Reinforcement-Learning Objectives are PAC-learnable Mar 9, 2023 General Reinforcement Learning reinforcement-learning
— Unverified 0Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective Mar 9, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards Mar 9, 2023 Decision Making regression
— Unverified 0Exploiting Contextual Structure to Generate Useful Auxiliary Tasks Mar 9, 2023 counterfactual Counterfactual Reasoning
— Unverified 0Beware of Instantaneous Dependence in Reinforcement Learning Mar 9, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Power and Interference Control for VLC-Based UDN: A Reinforcement Learning Approach Mar 9, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Task Aware Dreamer for Task Generalization in Reinforcement Learning Mar 9, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Using Memory-Based Learning to Solve Tasks with State-Action Constraints Mar 8, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning Mar 8, 2023 counterfactual reinforcement-learning
Code Code Available 0MCTS-GEB: Monte Carlo Tree Search is a Good E-graph Builder Mar 8, 2023 graph construction Reinforcement Learning (RL)
Code Code Available 0Deep Occupancy-Predictive Representations for Autonomous Driving Mar 7, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0A Multiplicative Value Function for Safe and Efficient Reinforcement Learning Mar 7, 2023 Navigate reinforcement-learning
Code Code Available 1Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning Mar 7, 2023 continuous-control Continuous Control
Code Code Available 1Learning When to Treat Business Processes: Prescriptive Process Monitoring with Causal Inference and Reinforcement Learning Mar 7, 2023 Causal Inference Conformal Prediction
Code Code Available 0Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles Mar 7, 2023 Image Generation reinforcement-learning
Code Code Available 1Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning Mar 7, 2023 Continuous Control Offline RL
— Unverified 0Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots Mar 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Decoupling Skill Learning from Robotic Control for Generalizable Object Manipulation Mar 7, 2023 Imitation Learning Reinforcement Learning (RL)
— Unverified 0adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems Mar 7, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Graph Decision Transformer Mar 7, 2023 Offline RL OpenAI Gym
— Unverified 0Evolutionary Reinforcement Learning: A Survey Mar 7, 2023 Board Games Hyperparameter Optimization
— Unverified 0Learning Bipedal Walking for Humanoids with Current Feedback Mar 7, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 3On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples Mar 7, 2023 Offline RL Off-policy evaluation
— Unverified 0Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers Mar 6, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments Mar 6, 2023 Deep Reinforcement Learning Motion Planning
— Unverified 0Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment Mar 6, 2023 Reinforcement Learning (RL)
— Unverified 0Perspectives on the Social Impacts of Reinforcement Learning with Human Feedback Mar 6, 2023 Misinformation reinforcement-learning
— Unverified 0Safe Reinforcement Learning via Probabilistic Logic Shields Mar 6, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mar 6, 2023 continuous-control Continuous Control
— Unverified 0Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning Mar 5, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A Reinforcement Learning Based Approach Mar 5, 2023 Management Reinforcement Learning (RL)
— Unverified 0Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks Mar 5, 2023 continuous-control Continuous Control
Code Code Available 0Ensemble Reinforcement Learning: A Survey Mar 5, 2023 Ensemble Learning Model Selection
— Unverified 0Bounding the Optimal Value Function in Compositional Reinforcement Learning Mar 5, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Local Environment Poisoning Attacks on Federated Reinforcement Learning Mar 5, 2023 Federated Learning OpenAI Gym
— Unverified 0CFlowNets: Continuous Control with Generative Flow Networks Mar 4, 2023 Active Learning continuous-control
Code Code Available 0Look-Ahead AC Optimal Power Flow: A Model-Informed Reinforcement Learning Approach Mar 4, 2023 Decision Making reinforcement-learning
— Unverified 0Double A3C: Deep Reinforcement Learning on OpenAI Gym Games Mar 4, 2023 Atari Games Deep Reinforcement Learning
— Unverified 0Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control Mar 4, 2023 MuJoCo Q-Learning
— Unverified 0Neural Airport Ground Handling Mar 4, 2023 Combinatorial Optimization Reinforcement Learning (RL)
Code Code Available 1