Reward Shaping via Diffusion Process in Reinforcement Learning Jun 20, 2023 Navigate reinforcement-learning
— Unverified 0Adaptive Ordered Information Extraction with Deep Reinforcement Learning Jun 19, 2023 Deep Reinforcement Learning Event Extraction
Code Code Available 0On the Model-Misspecification in Reinforcement Learning Jun 19, 2023 model Open-Ended Question Answering
— Unverified 0AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents Jun 19, 2023 Deep Reinforcement Learning MuJoCo
Code Code Available 0Enhancing variational quantum state diagonalization using reinforcement learning techniques Jun 19, 2023 Quantum Machine Learning reinforcement-learning
Code Code Available 0Acceleration in Policy Optimization Jun 18, 2023 Meta-Learning Policy Gradient Methods
— Unverified 0The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions Jun 17, 2023 Atari Games Reinforcement Learning (RL)
— Unverified 0Genes in Intelligent Agents Jun 17, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Active Policy Improvement from Multiple Black-box Oracles Jun 17, 2023 Imitation Learning Reinforcement Learning (RL)
Code Code Available 0Do as I can, not as I get Jun 17, 2023 Knowledge Graphs Multi-modal Knowledge Graph
— Unverified 0Bootstrapped Representations in Reinforcement Learning Jun 16, 2023 Auxiliary Learning reinforcement-learning
— Unverified 0Temporal Difference Learning with Experience Replay Jun 16, 2023 Reinforcement Learning (RL)
— Unverified 0Semi-Offline Reinforcement Learning for Optimized Text Generation Jun 16, 2023 Offline RL reinforcement-learning
Code Code Available 0The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement Jun 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Real-Time Network-Level Traffic Signal Control: An Explicit Multiagent Coordination Method Jun 15, 2023 Reinforcement Learning (RL) Traffic Signal Control
— Unverified 0Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization Jun 15, 2023 Management Multi-agent Reinforcement Learning
— Unverified 0Predictive Maneuver Planning with Deep Reinforcement Learning (PMP-DRL) for comfortable and safe autonomous driving Jun 15, 2023 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling Jun 15, 2023 Reinforcement Learning (RL) Sensitivity
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Granger Causal Interaction Skill Chains Jun 15, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A reinforcement learning strategy for p-adaptation in high order solvers Jun 14, 2023 Computational Efficiency reinforcement-learning
— Unverified 0Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources Jun 14, 2023 Offline RL reinforcement-learning
— Unverified 0Off-policy Evaluation in Doubly Inhomogeneous Environments Jun 14, 2023 Offline RL Off-policy evaluation
Code Code Available 0Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning Jun 14, 2023 Autonomous Racing Decision Making
— Unverified 0Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning Jun 14, 2023 Meta Reinforcement Learning Navigate
— Unverified 0Multi-market Energy Optimization with Renewables via Reinforcement Learning Jun 13, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care Jun 13, 2023 Offline RL Q-Learning
— Unverified 0Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective Jun 13, 2023 Learning-To-Rank Offline RL
Code Code Available 0A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning Jun 13, 2023 D4RL Efficient Exploration
— Unverified 0DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback Jun 13, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning Jun 13, 2023 General Knowledge Management
Code Code Available 0A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning Jun 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Kernelized Reinforcement Learning with Order Optimal Regret Bounds Jun 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Combining Reinforcement Learning and Barrier Functions for Adaptive Risk Management in Portfolio Optimization Jun 12, 2023 Management Portfolio Optimization
— Unverified 0ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles Jun 12, 2023 Offline RL reinforcement-learning
— Unverified 0Diverse Projection Ensembles for Distributional Reinforcement Learning Jun 12, 2023 Distributional Reinforcement Learning Diversity
— Unverified 0Robust Reinforcement Learning through Efficient Adversarial Herding Jun 12, 2023 MuJoCo reinforcement-learning
— Unverified 0Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds Jun 12, 2023 Reinforcement Learning (RL)
— Unverified 0Online Prototype Alignment for Few-shot Policy Transfer Jun 12, 2023 Domain Adaptation Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning Jun 11, 2023 Imitation Learning Motion Planning
— Unverified 0PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning Jun 10, 2023 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel Jun 9, 2023 Decision Making reinforcement-learning
— Unverified 0The Role of Diverse Replay for Generalisation in Reinforcement Learning Jun 9, 2023 Diversity reinforcement-learning
— Unverified 0Learning Not to Spoof Jun 9, 2023 Reinforcement Learning (RL)
— Unverified 0Approximate information state based convergence analysis of recurrent Q-learning Jun 9, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Iteratively Refined Behavior Regularization for Offline Reinforcement Learning Jun 9, 2023 D4RL Offline RL
— Unverified 0Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation Jun 9, 2023 Policy Gradient Methods reinforcement-learning
— Unverified 0Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning Jun 8, 2023 Decision Making Offline RL
— Unverified 0Timing Process Interventions with Causal Inference and Reinforcement Learning Jun 7, 2023 Causal Inference reinforcement-learning
— Unverified 0State Regularized Policy Optimization on Data with Dynamics Shift Jun 6, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0