Reward Shaping for Human Learning via Inverse Reinforcement Learning Feb 25, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity Feb 24, 2020 Language Modeling Language Modelling
— Unverified 0Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic Feb 24, 2020 Open-Ended Question Answering reinforcement-learning
— Unverified 0Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning Feb 24, 2020 Distributional Reinforcement Learning Q-Learning
— Unverified 0Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach Feb 24, 2020 Autonomous Driving continuous-control
Code Code Available 0Optimizing Traffic Lights with Multi-agent Deep Reinforcement Learning and V2X communication Feb 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence Feb 23, 2020 Management reinforcement-learning
— Unverified 0Near-optimal Regret Bounds for Stochastic Shortest Path Feb 23, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Rapidly Personalizing Mobile Health Treatment Policies with Limited Data Feb 23, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning with Linear Quadratic Regulator Regions Feb 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Automatic Data Augmentation via Deep Reinforcement Learning for Effective Kidney Tumor Segmentation Feb 22, 2020 Data Augmentation Deep Reinforcement Learning
— Unverified 0Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion Feb 22, 2020 Deep Reinforcement Learning Reinforcement Learning
— Unverified 0Adversarial Radar Inference. From Inverse Tracking to Inverse Reinforcement Learning of Cognitive Radar Feb 22, 2020 Reinforcement Learning (RL) Stochastic Optimization
— Unverified 0Vehicle Tracking in Wireless Sensor Networks via Deep Reinforcement Learning Feb 22, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On the Search for Feedback in Reinforcement Learning Feb 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution Strategy Feb 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep Reinforcement Learning Approach Feb 21, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning Feb 21, 2020 Atari Games Object
— Unverified 0Adaptive Temporal Difference Learning with Linear Function Approximation Feb 20, 2020 OpenAI Gym reinforcement-learning
— Unverified 0Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search Feb 20, 2020 Action Segmentation Gesture Recognition
— Unverified 0Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning Feb 20, 2020 Autonomous Navigation Deep Reinforcement Learning
— Unverified 0Multi-Agent Reinforcement Learning as a Computational Tool for Language Evolution Research: Historical Context and Future Challenges Feb 20, 2020 BIG-bench Machine Learning Multi-agent Reinforcement Learning
— Unverified 0oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions Feb 20, 2020 continuous-control Continuous Control
— Unverified 0Debiased Off-Policy Evaluation for Recommendation Systems Feb 20, 2020 counterfactual Off-policy evaluation
— Unverified 0Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems Feb 20, 2020 Edge-computing Meta Reinforcement Learning
— Unverified 0UAV Aided Search and Rescue Operation Using Reinforcement Learning Feb 19, 2020 Q-Learning reinforcement-learning
— Unverified 0Value-driven Hindsight Modelling Feb 19, 2020 Atari Games Reinforcement Learning
— Unverified 0Optimistic Policy Optimization with Bandit Feedback Feb 19, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning Feb 19, 2020 continuous-control Continuous Control
— Unverified 0Efficient Deep Reinforcement Learning via Adaptive Policy Transfer Feb 19, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Curriculum in Gradient-Based Meta-Reinforcement Learning Feb 19, 2020 Benchmarking Meta-Learning
— Unverified 0KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge Feb 18, 2020 Common Sense Reasoning continuous-control
— Unverified 0Empirical Policy Evaluation with Supergraphs Feb 18, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Multi-Issue Bargaining With Deep Reinforcement Learning Feb 18, 2020 continuous-control Continuous Control
— Unverified 0MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding Feb 18, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement learning for the privacy preservation and manipulation of eye tracking data Feb 17, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Reward Design for Driver Repositioning Using Multi-Agent Reinforcement Learning Feb 17, 2020 Bayesian Optimization Bilevel Optimization
— Unverified 0Langevin DQN Feb 17, 2020 Computational Efficiency Open-Ended Question Answering
Code Code Available 0Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning Feb 17, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Adaptive Experience Selection for Policy Gradient Feb 17, 2020 continuous-control Continuous Control
— Unverified 0Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning Feb 16, 2020 Deep Reinforcement Learning Object
— Unverified 0Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling Feb 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI Feb 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning Feb 15, 2020 Density Estimation Imitation Learning
Code Code Available 0Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning Feb 14, 2020 Deep Reinforcement Learning Management
— Unverified 0Robust Reinforcement Learning via Adversarial training with Langevin Dynamics Feb 14, 2020 MuJoCo reinforcement-learning
Code Code Available 0Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement Learning Feb 14, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Deep Reinforcement Learning-Based Beam Tracking for Low-Latency Services in Vehicular Networks Feb 13, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic Feb 13, 2020 Autonomous Driving Autonomous Vehicles
— Unverified 0