On the Convergence of Reinforcement Learning with Monte Carlo Exploring Starts Jul 21, 2020 Open-Ended Question Answering reinforcement-learning
— Unverified 0Soft Expert Reward Learning for Vision-and-Language Navigation Jul 21, 2020 Reinforcement Learning (RL) Vision and Language Navigation
— Unverified 0Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games for Adaptive Moving Target Defense Jul 20, 2020 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks Jul 20, 2020 BIG-bench Machine Learning Edge-computing
— Unverified 0Lagrangian Duality in Reinforcement Learning Jul 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Active MR k-space Sampling with Reinforcement Learning Jul 20, 2020 Image Reconstruction reinforcement-learning
Code Code Available 1Interpretable Control by Reinforcement Learning Jul 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop Jul 20, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1A Short Note on Soft-max and Policy Gradients in Bandits Problems Jul 20, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0An Overview of Natural Language State Representation for Reinforcement Learning Jul 19, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search Jul 18, 2020 Meta-Learning Meta Reinforcement Learning
— Unverified 0Structure Mapping for Transferability of Causal Models Jul 18, 2020 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Quick Question: Interrupting Users for Microtasks with Reinforcement Learning Jul 18, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0WordCraft: An Environment for Benchmarking Commonsense Agents Jul 17, 2020 Benchmarking Knowledge Graphs
Code Code Available 1Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search Jul 17, 2020 Generative Adversarial Network GPU
Code Code Available 1Hierarchical Deep Reinforcement Learning Approach for Multi-Objective Scheduling With Varying Queue Sizes Jul 17, 2020 Deep Reinforcement Learning Position
— Unverified 0Hyperparameter Selection for Offline Reinforcement Learning Jul 17, 2020 Offline RL reinforcement-learning
— Unverified 0Discovering Reinforcement Learning Algorithms Jul 17, 2020 Atari Games Meta-Learning
Code Code Available 1Human-like Energy Management Based on Deep Reinforcement Learning and Historical Driving Experiences Jul 16, 2020 Deep Reinforcement Learning energy management
— Unverified 0Decision-making Strategy on Highway for Autonomous Vehicles using Deep Reinforcement Learning Jul 16, 2020 Autonomous Driving Autonomous Vehicles
— Unverified 0Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators Jul 16, 2020 Deep Reinforcement Learning GPU
— Unverified 0Dueling Deep Q Network for Highway Decision Making in Autonomous Vehicles: A Case Study Jul 16, 2020 Autonomous Vehicles Decision Making
— Unverified 0DRIFT: Deep Reinforcement Learning for Functional Software Testing Jul 16, 2020 Deep Reinforcement Learning Graph Neural Network
— Unverified 0Collision Avoidance Robotics Via Meta-Learning (CARML) Jul 16, 2020 Collision Avoidance Meta-Learning
Code Code Available 0CoNES: Convex Natural Evolutionary Strategies Jul 16, 2020 Benchmarking MuJoCo
— Unverified 0Meta-Gradient Reinforcement Learning with an Objective Discovered Online Jul 16, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Provably Good Batch Reinforcement Learning Without Great Exploration Jul 16, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Transferred Energy Management Strategies for Hybrid Electric Vehicles Based on Driving Conditions Recognition Jul 16, 2020 Computational Efficiency energy management
— Unverified 0Weighing Counts: Sequential Crowd Counting by Reinforcement Learning Jul 16, 2020 Crowd Counting Deep Reinforcement Learning
Code Code Available 1Transfer Deep Reinforcement Learning-enabled Energy Management Strategy for Hybrid Tracked Vehicle Jul 16, 2020 Deep Reinforcement Learning energy management
— Unverified 0Reinforcement Learning-Enabled Decision-Making Strategies for a Vehicle-Cyber-Physical-System in Connected Environment Jul 16, 2020 Autonomous Vehicles Decision Making
— Unverified 0Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems Jul 15, 2020 Deep Reinforcement Learning Edge-computing
— Unverified 0Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors Jul 15, 2020 Developmental Learning Drone Controller
Code Code Available 1Inverse Reinforcement Learning from a Gradient-based Learner Jul 15, 2020 MuJoCo reinforcement-learning
— Unverified 0Computation Offloading in Beyond 5G Networks: A Distributed Learning Framework and Applications Jul 15, 2020 Edge-computing Reinforcement Learning (RL)
— Unverified 0Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions Jul 15, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning Jul 15, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity Jul 15, 2020 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning Jul 14, 2020 Network Pruning reinforcement-learning
— Unverified 0Learning to Sample with Local and Global Contexts in Experience Replay Buffer Jul 14, 2020 Reinforcement Learning (RL)
— Unverified 0Single-partition adaptive Q-learning Jul 14, 2020 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Learning Robust State Abstractions for Hidden-Parameter Block MDPs Jul 14, 2020 Generalization Bounds Meta Reinforcement Learning
Code Code Available 1Robustifying Reinforcement Learning Agents via Action Space Adversarial Training Jul 14, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Revisiting Fundamentals of Experience Replay Jul 13, 2020 Deep Reinforcement Learning DQN Replay Dataset
Code Code Available 0Reinforcement Learning of Musculoskeletal Control from Functional Simulations Jul 13, 2020 Anatomy Deep Reinforcement Learning
Code Code Available 0Implicit Distributional Reinforcement Learning Jul 13, 2020 Distributional Reinforcement Learning OpenAI Gym
Code Code Available 1AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning Jul 13, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0A Provably Efficient Sample Collection Strategy for Reinforcement Learning Jul 13, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0XCS as a reinforcement learning approach to automatic test case prioritization Jul 12, 2020 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches Jul 12, 2020 Decision Making Reinforcement Learning (RL)
— Unverified 0