Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error Dec 26, 2022 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Novel Reinforcement Learning Algorithm for Suppressing Synchronization in Closed Loop Deep Brain Stimulators Dec 25, 2022 Reinforcement Learning (RL)
— Unverified 0Understanding the Complexity Gains of Single-Task RL with a Curriculum Dec 24, 2022 Reinforcement Learning (RL)
— Unverified 0Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning Dec 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0SHIRO: Soft Hierarchical Reinforcement Learning Dec 24, 2022 Decision Making Efficient Exploration
— Unverified 0Automated Gadget Discovery in Science Dec 24, 2022 Clustering Reinforcement Learning (RL)
Code Code Available 0Deep Reinforcement Learning for Heat Pump Control Dec 24, 2022 Deep Reinforcement Learning Model Predictive Control
— Unverified 0Investigation of reinforcement learning for shape optimization of profile extrusion dies Dec 23, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0NARS vs. Reinforcement learning: ONA vs. Q-Learning Dec 23, 2022 Q-Learning reinforcement-learning
Code Code Available 0Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Dec 23, 2022 Decision Making Off-policy evaluation
— Unverified 0Reinforcement Learning Based Approaches to Adaptive Context Caching in Distributed Context Management Systems Dec 22, 2022 Management reinforcement-learning
— Unverified 0A Learned Simulation Environment to Model Student Engagement and Retention in Automated Online Courses Dec 22, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning Dec 22, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Decoding surface codes with deep reinforcement learning and probabilistic policy reuse Dec 22, 2022 Deep Reinforcement Learning Q-Learning
— Unverified 0Hyperparameters in Contextual RL are Highly Situational Dec 21, 2022 Hyperparameter Optimization reinforcement-learning
Code Code Available 0Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios Dec 21, 2022 Autonomous Driving Imitation Learning
— Unverified 0Lifelong Reinforcement Learning with Modulating Masks Dec 21, 2022 Lifelong learning reinforcement-learning
Code Code Available 0A Memetic Algorithm with Reinforcement Learning for Sociotechnical Production Scheduling Dec 21, 2022 Deep Reinforcement Learning Job Shop Scheduling
— Unverified 0Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement Learning Dec 21, 2022 Deep Reinforcement Learning Q-Learning
Code Code Available 0Robust Path Selection in Software-defined WANs using Deep Reinforcement Learning Dec 21, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Neighboring state-based RL Exploration Dec 21, 2022 Decision Making reinforcement-learning
— Unverified 0Variational Quantum Soft Actor-Critic for Robotic Arm Control Dec 20, 2022 continuous-control Continuous Control
— Unverified 0AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning Dec 20, 2022 Meta-Learning Multi-agent Reinforcement Learning
— Unverified 0Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation Dec 20, 2022 Decision Making Multi-agent Reinforcement Learning
— Unverified 0I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons Dec 20, 2022 Reinforcement Learning (RL) Text Generation
— Unverified 0Adapting the Exploration Rate for Value-of-Information-Based Reinforcement Learning Dec 20, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Inverse Reinforcement Learning for Text Summarization Dec 19, 2022 Abstractive Text Summarization reinforcement-learning
— Unverified 0Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance Dec 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Near-optimal Policy Identification in Active Reinforcement Learning Dec 19, 2022 Bayesian Optimization reinforcement-learning
— Unverified 0Taming Lagrangian Chaos with Multi-Objective Reinforcement Learning Dec 19, 2022 Multi-Objective Reinforcement Learning Q-Learning
— Unverified 0Quantum policy gradient algorithms Dec 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Risk-Sensitive Reinforcement Learning with Exponential Criteria Dec 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Neural Coreference Resolution based on Reinforcement Learning Dec 18, 2022 All Clustering
— Unverified 0Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off Dec 17, 2022 continuous-control Continuous Control
— Unverified 0Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning Dec 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning Dec 17, 2022 OpenAI Gym reinforcement-learning
— Unverified 0Cognitive Level-k Meta-Learning for Safe and Pedestrian-Aware Autonomous Driving Dec 17, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Latent Variable Representation for Reinforcement Learning Dec 17, 2022 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Comparison of Model-Free and Model-Based Learning-Informed Planning for PointGoal Navigation Dec 17, 2022 Deep Reinforcement Learning model
Code Code Available 0Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling Dec 16, 2022 MuJoCo Q-Learning
— Unverified 0Safe Evaluation For Offline Learning: Are We Ready To Deploy? Dec 16, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0An Energy-aware and Fault-tolerant Deep Reinforcement Learning based approach for Multi-agent Patrolling Problems Dec 16, 2022 Autonomous Vehicles Deep Reinforcement Learning
— Unverified 0Reinforcement Learning for Agile Active Target Sensing with a UAV Dec 16, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning in Credit Scoring and Underwriting Dec 15, 2022 Decision Making Efficient Exploration
— Unverified 0Residual Policy Learning for Powertrain Control Dec 15, 2022 Reinforcement Learning (RL)
— Unverified 0Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management Dec 15, 2022 Management Multi-agent Reinforcement Learning
— Unverified 0Towards Hardware-Specific Automatic Compression of Neural Networks Dec 15, 2022 Quantization reinforcement-learning
— Unverified 0Active Inference and Reinforcement Learning: A unified inference on continuous state and action spaces under partial observability Dec 15, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet Dec 15, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0Emergent Behaviors in Multi-Agent Target Acquisition Dec 15, 2022 Reinforcement Learning (RL)
— Unverified 0