Branch Prediction as a Reinforcement Learning Problem: Why, How and Case Studies Jun 25, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Control of a Mixed Autonomy Signalised Urban Intersection: An Action-Delayed Reinforcement Learning Approach Jun 24, 2021 Reinforcement Learning (RL)
— Unverified 0Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation Jun 24, 2021 MuJoCo OpenAI Gym
Code Code Available 2Density Constrained Reinforcement Learning Jun 24, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Model-Based Reinforcement Learning via Latent-Space Collocation Jun 24, 2021 model Model-based Reinforcement Learning
Code Code Available 1Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Jun 24, 2021 Meta Reinforcement Learning Off-policy evaluation
Code Code Available 1Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots Jun 24, 2021 Deep Reinforcement Learning Navigate
— Unverified 0The Option Keyboard: Combining Skills in Reinforcement Learning Jun 24, 2021 Management reinforcement-learning
— Unverified 0Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations Jun 23, 2021 Event Extraction Incremental Learning
Code Code Available 1Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning Jun 23, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Bregman Gradient Policy Optimization Jun 23, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Uncertainty-Aware Model-Based Reinforcement Learning with Application to Autonomous Driving Jun 23, 2021 Autonomous Driving Model-based Reinforcement Learning
— Unverified 0Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL Jun 22, 2021 Deep Reinforcement Learning Offline RL
— Unverified 0Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation Jun 22, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Local policy search with Bayesian optimization Jun 22, 2021 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 1Off-Policy Reinforcement Learning with Delayed Rewards Jun 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning Jun 22, 2021 Distributional Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Variance-Aware Off-Policy Evaluation with Linear Function Approximation Jun 22, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Physical Layer Communications Jun 22, 2021 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations Jun 22, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning Jun 22, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 0Lifted Model Checking for Relational MDPs Jun 22, 2021 model Model-based Reinforcement Learning
— Unverified 0Distributed Heuristic Multi-Agent Path Finding with Communication Jun 21, 2021 Multi-Agent Path Finding Q-Learning
Code Code Available 1Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations Jun 21, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Emphatic Algorithms for Deep Reinforcement Learning Jun 21, 2021 Atari Games Deep Reinforcement Learning
— Unverified 0Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming Jun 21, 2021 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems Jun 21, 2021 Management Q-Learning
— Unverified 0Policy Smoothing for Provably Robust Reinforcement Learning Jun 21, 2021 Adversarial Robustness image-classification
— Unverified 0OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation Jun 21, 2021 Offline RL Reinforcement Learning (RL)
Code Code Available 1Scientific multi-agent reinforcement learning for wall-models of turbulent flows Jun 21, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Analytically Tractable Bayesian Deep Q-Learning Jun 21, 2021 Q-Learning reinforcement-learning
— Unverified 0Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach Jun 19, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Video Summarization through Reinforcement Learning with a 3D Spatio-Temporal U-Net Jun 19, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Boosting Offline Reinforcement Learning with Residual Generative Modeling Jun 19, 2021 Offline RL Q-Learning
— Unverified 0A Max-Min Entropy Framework for Reinforcement Learning Jun 19, 2021 Disentanglement reinforcement-learning
Code Code Available 1Adversarially Trained Neural Policies in the Fourier Domain Jun 18, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk Jun 18, 2021 continuous-control Continuous Control
Code Code Available 1Non-Robust Feature Mapping in Deep Reinforcement Learning Jun 18, 2021 Atari Games Deep Reinforcement Learning
— Unverified 0Strategically-timed State-Observation Attacks on Deep Reinforcement Learning Agents Jun 18, 2021 Adversarial Attack continuous-control
— Unverified 0Sample Efficient Social Navigation Using Inverse Reinforcement Learning Jun 18, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments Jun 18, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Proper Value Equivalence Jun 18, 2021 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0MADE: Exploration via Maximizing Deviation from Explored Regions Jun 18, 2021 Efficient Exploration Reinforcement Learning (RL)
Code Code Available 1The Curse of Passive Data Collection in Batch Reinforcement Learning Jun 18, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result Jun 18, 2021 Deep Reinforcement Learning Object Recognition
— Unverified 0Goal-Directed Planning by Reinforcement Learning and Active Inference Jun 18, 2021 Bayesian Inference Decision Making
— Unverified 0Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes Jun 17, 2021 Classification Deep Reinforcement Learning
— Unverified 0A Reinforcement Learning Approach for an IRS-assisted NOMA Network Jun 17, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Adapting the Function Approximation Architecture in Online Reinforcement Learning Jun 17, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning Based Optimization for IRS Based UAV-NOMA Downlink Networks Jun 17, 2021 Deep Reinforcement Learning Position
— Unverified 0