Local Search for Policy Iteration in Continuous Control Oct 12, 2020 continuous-control Continuous Control
— Unverified 0Human-centric Dialog Training via Offline Reinforcement Learning Oct 12, 2020 Language Modelling Offline RL
— Unverified 0AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control Oct 12, 2020 Decision Making reinforcement-learning
— Unverified 0Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? Oct 12, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Smaller World Models for Reinforcement Learning Oct 12, 2020 Atari Games reinforcement-learning
— Unverified 0Remote Electrical Tilt Optimization via Safe Reinforcement Learning Oct 12, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption Oct 12, 2020 Reinforcement Learning (RL)
— Unverified 0Nearly Minimax Optimal Reward-free Reinforcement Learning Oct 12, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Safe Reinforcement Learning with Natural Language Constraints Oct 11, 2020 Autonomous Navigation reinforcement-learning
— Unverified 0Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions Oct 11, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks Oct 11, 2020 Marketing reinforcement-learning
— Unverified 0Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Cellular Systems Oct 11, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection Oct 10, 2020 Answer Selection Reinforcement Learning (RL)
— Unverified 0Trust the Model When It Is Confident: Masked Model-based Actor-Critic Oct 10, 2020 continuous-control Continuous Control
— Unverified 0Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty Oct 10, 2020 Management Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks Oct 10, 2020 CPU Management
— Unverified 0Parameterized Reinforcement Learning for Optical System Optimization Oct 9, 2020 Q-Learning reinforcement-learning
— Unverified 0Jointly-Learned State-Action Embedding for Efficient Reinforcement Learning Oct 9, 2020 Model-based Reinforcement Learning Recommendation Systems
— Unverified 0Characterizing Policy Divergence for Personalized Meta-Reinforcement Learning Oct 9, 2020 Diversity Meta-Learning
— Unverified 0Deep RL With Information Constrained Policies: Generalization in Continuous Control Oct 9, 2020 continuous-control Continuous Control
— Unverified 0Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments Oct 9, 2020 Incremental Learning Q-Learning
Code Code Available 0Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning Oct 9, 2020 Machine Translation reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Asset Allocation in US Equities Oct 9, 2020 Deep Reinforcement Learning Management
— Unverified 0Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning Oct 9, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Information-Driven Adaptive Sensing Based on Deep Reinforcement Learning Oct 8, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Intrinsic Symbolic Rewards in Reinforcement Learning Oct 8, 2020 Deep Reinforcement Learning MuJoCo
— Unverified 0Maximum Reward Formulation In Reinforcement Learning Oct 8, 2020 Drug Discovery reinforcement-learning
Code Code Available 0Nonstationary Reinforcement Learning with Linear Function Approximation Oct 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Provable Fictitious Play for General Mean-Field Games Oct 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Regularized Inverse Reinforcement Learning Oct 7, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving Oct 7, 2020 continuous-control Continuous Control
— Unverified 0Online Safety Assurance for Deep Reinforcement Learning Oct 7, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Variational Intrinsic Control Revisited Oct 7, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control Oct 7, 2020 Computational Efficiency Q-Learning
— Unverified 0Actor-Critic Algorithm for High-dimensional Partial Differential Equations Oct 7, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective Oct 7, 2020 Active Learning Multi-Armed Bandits
— Unverified 0Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited Oct 7, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Diverse Options via InfoMax Termination Critic Oct 6, 2020 Continuous Control Diversity
Code Code Available 0Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping Oct 6, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Safety Aware Reinforcement Learning (SARL) Oct 6, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning Oct 6, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Sentiment Analysis for Reinforcement Learning Oct 5, 2020 Dialogue Generation reinforcement-learning
— Unverified 0Meta-Learning of Structured Task Distributions in Humans and Machines Oct 5, 2020 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Policy Learning Using Weak Supervision Oct 5, 2020 Reinforcement Learning (RL)
Code Code Available 0The act of remembering: a study in partially observable reinforcement learning Oct 5, 2020 Partially Observable Reinforcement Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Electric Vehicle Routing Problem with Time Windows Oct 5, 2020 Deep Reinforcement Learning Graph Embedding
— Unverified 0Deep Reinforcement Learning for Collaborative Edge Computing in Vehicular Networks Oct 5, 2020 Deep Reinforcement Learning Edge-computing
— Unverified 0Learning to Generalize for Sequential Decision Making Oct 5, 2020 Decision Making Imitation Learning
Code Code Available 0A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning Oct 5, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0Goal-directed Generation of Discrete Structures with Conditional Generative Models Oct 5, 2020 Heuristic Search Program Synthesis
— Unverified 0