Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning Jul 25, 2022 Natural Language Understanding reinforcement-learning
— Unverified 0Cooperative Actor-Critic via TD Error Aggregation Jul 25, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks Jul 25, 2022 Chemical Process Decision Making
— Unverified 0REPNP: Plug-and-Play with Deep Reinforcement Learning Prior for Robust Image Restoration Jul 25, 2022 Deblurring Deep Reinforcement Learning
— Unverified 0Online Reinforcement Learning for Periodic MDP Jul 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations Jul 25, 2022 Decision Making Meta-Learning
— Unverified 0Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning Jul 25, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Adaptive Decision Making at the Intersection for Autonomous Vehicles Based on Skill Discovery Jul 24, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System Jul 24, 2022 Reinforcement Learning (RL)
— Unverified 0Halftoning with Multi-Agent Deep Reinforcement Learning Jul 23, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Epersist: A Self Balancing Robot Using PID Controller And Deep Reinforcement Learning Jul 23, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution Jul 22, 2022 Algorithmic Trading continuous-control
— Unverified 0Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning Jul 21, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Jul 21, 2022 Autonomous Driving D4RL
— Unverified 0Detecting Small Query Graphs in A Large Graph via Neural Subgraph Search Jul 21, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Strategising template-guided needle placement for MR-targeted prostate biopsy Jul 21, 2022 Anatomy Decision Making
— Unverified 0Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning Jul 21, 2022 Deep Reinforcement Learning Management
— Unverified 0Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise Jul 21, 2022 Management Reinforcement Learning (RL)
Code Code Available 0Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning Jul 21, 2022 Autonomous Driving reinforcement-learning
— Unverified 0On the Implementation of a Reinforcement Learning-based Capacity Sharing Algorithm in O-RAN Jul 21, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Successor Representation Active Inference Jul 20, 2022 Reinforcement Learning (RL)
Code Code Available 0Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks Jul 20, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Riemannian Stochastic Gradient Method for Nested Composition Optimization Jul 19, 2022 Meta-Learning reinforcement-learning
— Unverified 0Magpie: Automatically Tuning Static Parameters for Distributed File Systems using Deep Reinforcement Learning Jul 19, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning Jul 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios Jul 19, 2022 Federated Learning Q-Learning
— Unverified 0Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments Jul 19, 2022 MuJoCo reinforcement-learning
— Unverified 0Few-Shot Teamwork Jul 19, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks Jul 19, 2022 Efficient Exploration Meta Reinforcement Learning
— Unverified 0Actor-Critic based Improper Reinforcement Learning Jul 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Boolean Decision Rules for Reinforcement Learning Policy Summarisation Jul 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0An Information-Theoretic Analysis of Bayesian Reinforcement Learning Jul 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation Jul 18, 2022 Imitation Learning Reinforcement Learning (RL)
— Unverified 0MLGOPerf: An ML Guided Inliner to Optimize Performance Jul 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0MAD for Robust Reinforcement Learning in Machine Translation Jul 18, 2022 Machine Translation reinforcement-learning
— Unverified 0A framework for online, stabilizing reinforcement learning Jul 18, 2022 Management reinforcement-learning
— Unverified 0Minimum Description Length Control Jul 17, 2022 Bayesian Inference continuous-control
— Unverified 0Robust Action Governor for Uncertain Piecewise Affine Systems with Non-convex Constraints and Safe Reinforcement Learning Jul 17, 2022 RAG Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning For Survival, A Clinically Motivated Method For Critically Ill Patients Jul 17, 2022 Clinical Knowledge reinforcement-learning
— Unverified 0Context sequence theory: a common explanation for multiple types of learning Jul 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0DIMBA: Discretely Masked Black-Box Attack in Single Object Tracking Jul 17, 2022 Adversarial Attack Miscellaneous
— Unverified 0Associative Memory Based Experience Replay for Deep Reinforcement Learning Jul 16, 2022 CPU Deep Reinforcement Learning
— Unverified 0BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion Jul 16, 2022 Offline RL reinforcement-learning
— Unverified 0Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning Jul 15, 2022 Data Augmentation Deep Reinforcement Learning
Code Code Available 0Deep Hedging: Continuous Reinforcement Learning for Hedging of General Portfolios across Multiple Risk Aversions Jul 15, 2022 Reinforcement Learning (RL)
— Unverified 0Optimizing Data Collection in Deep Reinforcement Learning Jul 15, 2022 CPU Deep Reinforcement Learning
— Unverified 0Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space Jul 15, 2022 counterfactual Reinforcement Learning (RL)
— Unverified 0Skill-based Model-based Reinforcement Learning Jul 15, 2022 model Model-based Reinforcement Learning
— Unverified 0The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning Jul 15, 2022 Distributional Reinforcement Learning quantile regression
— Unverified 0Multi-Agent Deep Reinforcement Learning-Driven Mitigation of Adverse Effects of Cyber-Attacks on Electric Vehicle Charging Station Jul 14, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0