Safe Reinforcement Learning for Legged Locomotion Mar 5, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Target Network and Truncation Overcome The Deadly Triad in Q-Learning Mar 5, 2022 Q-Learning reinforcement-learning
— Unverified 0Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions Mar 4, 2022 Causal Inference Decision Making
— Unverified 0GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning Mar 4, 2022 Object reinforcement-learning
— Unverified 0Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning Mar 4, 2022 Deep Reinforcement Learning Domain Adaptation
— Unverified 0Intrinsically-Motivated Reinforcement Learning: A Brief Introduction Mar 3, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Bilateral Deep Reinforcement Learning Approach for Better-than-human Car Following Model Mar 3, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Deep Q-network using reservoir computing with multi-layered readout Mar 3, 2022 Reinforcement Learning (RL) Time Series
— Unverified 0Reasoning about Counterfactuals to Improve Human Inverse Reinforcement Learning Mar 3, 2022 counterfactual Counterfactual Reasoning
Code Code Available 0Optimized cost function for demand response coordination of multiple EV charging stations using reinforcement learning Mar 3, 2022 Reinforcement Learning (RL)
— Unverified 0On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency Mar 3, 2022 Offline RL reinforcement-learning
Code Code Available 0The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches Mar 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Quantum Reinforcement Learning via Policy Iteration Mar 3, 2022 Decision Making reinforcement-learning
— Unverified 0Pareto Frontier Approximation Network (PA-Net) to Solve Bi-objective TSP Mar 2, 2022 Reinforcement Learning (RL) Scheduling
— Unverified 0Reliable validation of Reinforcement Learning Benchmarks Mar 2, 2022 Benchmarking Data Compression
— Unverified 0Evolving Curricula with Regret-Based Environment Design Mar 2, 2022 Reinforcement Learning (RL)
— Unverified 0Combining Reinforcement Learning and Optimal Transport for the Traveling Salesman Problem Mar 2, 2022 Combinatorial Optimization Deep Learning
Code Code Available 0A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems Mar 2, 2022 Offline RL reinforcement-learning
Code Code Available 0Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning Mar 2, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Andes_gym: A Versatile Environment for Deep Reinforcement Learning in Power Systems Mar 2, 2022 Deep Reinforcement Learning OpenAI Gym
Code Code Available 0Learning in Sparse Rewards settings through Quality-Diversity algorithms Mar 2, 2022 Diversity Reinforcement Learning (RL)
— Unverified 0Integrating Contrastive Learning with Dynamic Models for Reinforcement Learning from Images Mar 2, 2022 Contrastive Learning Data Augmentation
Code Code Available 0DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction Mar 1, 2022 Contrastive Learning Model-based Reinforcement Learning
— Unverified 0Hierarchical Reinforcement Learning with AI Planning Models Mar 1, 2022 Decision Making Hierarchical Reinforcement Learning
Code Code Available 0Distributional Reinforcement Learning for Scheduling of Chemical Production Processes Mar 1, 2022 Decision Making Distributional Reinforcement Learning
— Unverified 0Explaining a Deep Reinforcement Learning Docking Agent Using Linear Model Trees with User Adapted Visualization Mar 1, 2022 Deep Reinforcement Learning Explainable artificial intelligence
— Unverified 0Approximating a deep reinforcement learning docking agent using linear model trees Mar 1, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Theory of Abstraction in Reinforcement Learning Mar 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0On the Generalization of Representations in Reinforcement Learning Mar 1, 2022 Atari Games reinforcement-learning
Code Code Available 0Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation Feb 28, 2022 continuous-control Continuous Control
— Unverified 0Probing the Robustness of Trained Metrics for Conversational Dialogue Systems Feb 28, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity Feb 28, 2022 Offline RL Q-Learning
— Unverified 0Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning Feb 28, 2022 Position reinforcement-learning
— Unverified 0A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning Feb 28, 2022 Dialogue Management Management
— Unverified 0Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming Feb 27, 2022 Portfolio Optimization reinforcement-learning
— Unverified 0RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization Feb 26, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons Feb 26, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation Feb 26, 2022 Edge-computing Q-Learning
— Unverified 0Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions Feb 26, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Domain Knowledge-Based Automated Analog Circuit Design with Deep Reinforcement Learning Feb 26, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach Feb 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Decision Making in Non-Stationary Environments with Policy-Augmented Monte Carlo Tree Search Feb 25, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates Feb 25, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Context-Hierarchy Inverse Reinforcement Learning Feb 25, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Consolidated Adaptive T-soft Update for Deep Reinforcement Learning Feb 25, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reachability analysis in stochastic directed graphs by reinforcement learning Feb 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Quantum Deep Reinforcement Learning for Robot Navigation Tasks Feb 24, 2022 BIG-bench Machine Learning Deep Reinforcement Learning
Code Code Available 0Learning Transferable Reward for Query Object Localization with Policy Adaptation Feb 24, 2022 Metric Learning Object Localization
Code Code Available 0Evolving-to-Learn Reinforcement Learning Tasks with Spiking Neural Networks Feb 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing Feb 24, 2022 Edge-computing Multi-Objective Reinforcement Learning
— Unverified 0