Semi-Supervised Off Policy Reinforcement Learning Dec 9, 2020 Imputation Q-Learning
— Unverified 0The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems Dec 8, 2020 CPU Deep Reinforcement Learning
— Unverified 0Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory Dec 8, 2020 Deep Reinforcement Learning OpenAI Gym
Code Code Available 0Emergence of Different Modes of Tool Use in a Reaching and Dragging Task Dec 8, 2020 Deep Reinforcement Learning Friction
— Unverified 0Efficient Reservoir Management through Deep Reinforcement Learning Dec 7, 2020 Deep Reinforcement Learning Management
— Unverified 0Battery Model Calibration with Deep Reinforcement Learning Dec 7, 2020 BIG-bench Machine Learning Deep Reinforcement Learning
— Unverified 0Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation Dec 7, 2020 Domain Adaptation Q-Learning
— Unverified 0Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning Dec 7, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning Dec 6, 2020 Board Games Deep Reinforcement Learning
— Unverified 0Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation Dec 5, 2020 Data Augmentation reinforcement-learning
— Unverified 0Multi-agent navigation based on deep reinforcement learning and traditional pathfinding algorithm Dec 5, 2020 Collision Avoidance Deep Reinforcement Learning
— Unverified 0Neural Dynamic Policies for End-to-End Sensorimotor Learning Dec 4, 2020 Imitation Learning reinforcement-learning
— Unverified 0Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation Dec 4, 2020 Model-based Reinforcement Learning Recommendation Systems
— Unverified 0Model-Agnostic Learning to Meta-Learn Dec 4, 2020 image-classification Image Classification
— Unverified 0Demonstration-efficient Inverse Reinforcement Learning in Procedurally Generated Environments Dec 4, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design Dec 3, 2020 Reinforcement Learning (RL) Transfer Learning
Code Code Available 0Dynamic RAN Slicing for Service-Oriented Vehicular Networks via Constrained Learning Dec 3, 2020 Reinforcement Learning (RL)
— Unverified 0DeepCrawl: Deep Reinforcement Learning for Turn-based Strategy Games Dec 3, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning Dec 3, 2020 Deep Reinforcement Learning Protein Design
— Unverified 0Partially Connected Automated Vehicle Cooperative Control Strategy with a Deep Reinforcement Learning Approach Dec 3, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Safe Reinforcement Learning Architecture for Antenna Tilt Optimisation Dec 2, 2020 Management reinforcement-learning
— Unverified 0Pareto Deterministic Policy Gradients and Its Application in 5G Massive MIMO Networks Dec 2, 2020 Reinforcement Learning (RL)
— Unverified 0Sample Complexity of Policy Gradient Finding Second-Order Stationary Points Dec 2, 2020 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Coinbot: Intelligent Robotic Coin Bag Manipulation Using Deep Reinforcement Learning And Machine Teaching Dec 2, 2020 Deep Reinforcement Learning Motion Planning
— Unverified 0Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER Dec 2, 2020 Reinforcement Learning (RL) valid
— Unverified 0Driving-Policy Adaptive Safeguard for Autonomous Vehicles Using Reinforcement Learning Dec 2, 2020 Autonomous Vehicles Collision Avoidance
— Unverified 0Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning? Dec 2, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis Dec 2, 2020 Disease Prediction Reinforcement Learning (RL)
— Unverified 0Combining Cognitive Modeling and Reinforcement Learning for Clarification in Dialogue Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Is Long Horizon RL More Difficult Than Short Horizon RL? Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs Dec 1, 2020 Diversity Hierarchical Reinforcement Learning
— Unverified 0EcoLight: Intersection Control in Developing Regions Under Extreme Budget and Network Constraints Dec 1, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Neural Machine Translation for Sanskrit-English Dec 1, 2020 Machine Translation reinforcement-learning
— Unverified 0Improving the Naturalness and Diversity of Referring Expression Generation models using Minimum Risk Training Dec 1, 2020 Diversity Referring Expression
— Unverified 0Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Assessing and Accelerating Coverage in Deep Reinforcement Learning Dec 1, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Instance-based Generalization in Reinforcement Learning Dec 1, 2020 Deep Reinforcement Learning Generalization Bounds
— Unverified 0Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning Dec 1, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Answer-driven Deep Question Generation based on Reinforcement Learning Dec 1, 2020 Decoder Question Generation
— Unverified 0A Local Temporal Difference Code for Distributional Reinforcement Learning Dec 1, 2020 Distributional Reinforcement Learning Imputation
— Unverified 0A Learning-Exploring Method to Generate Diverse Paraphrases with Multi-Objective Deep Reinforcement Learning Dec 1, 2020 Deep Reinforcement Learning Diversity
— Unverified 0A new convergent variant of Q-learning with linear function approximation Dec 1, 2020 Q-Learning Reinforcement Learning (RL)
— Unverified 0Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory Dec 1, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Robust Multi-Agent Reinforcement Learning with Model Uncertainty Dec 1, 2020 model Multi-agent Reinforcement Learning
— Unverified 0Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method Dec 1, 2020 continuous-control Continuous Control
— Unverified 0Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making Dec 1, 2020 Decision Making Reinforcement Learning (RL)
— Unverified 0RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning Dec 1, 2020 Offline RL reinforcement-learning
Code Code Available 0Text Simplification with Reinforcement Learning Using Supervised Rewards on Grammaticality, Meaning Preservation, and Simplicity Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 0