HARPO: Learning to Subvert Online Behavioral Advertising Nov 9, 2021 Reinforcement Learning (RL)
— Unverified 0Safe Policy Optimization with Local Generalized Linear Function Approximations Nov 9, 2021 Reinforcement Learning (RL) Safe Exploration
Code Code Available 0Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning Nov 9, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer Nov 8, 2021 Few-Shot Learning Meta Reinforcement Learning
Code Code Available 1Dueling RL: Reinforcement Learning with Trajectory Preferences Nov 8, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods Nov 8, 2021 Autonomous Vehicles Q-Learning
— Unverified 0Reinforcement Learning for Mixed Autonomy Intersections Nov 8, 2021 Multi-Task Learning reinforcement-learning
Code Code Available 1A Dataset Perspective on Offline Reinforcement Learning Nov 8, 2021 Offline RL reinforcement-learning
Code Code Available 1Interactive Inverse Reinforcement Learning for Cooperative Games Nov 8, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Batch Reinforcement Learning from Crowds Nov 8, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance Nov 7, 2021 Deep Reinforcement Learning Friction
— Unverified 0FinRL-Podracer: High Performance and Scalable Deep Reinforcement Learning for Quantitative Finance Nov 7, 2021 Deep Reinforcement Learning GPU
— Unverified 0Explainable Deep Reinforcement Learning for Portfolio Management: An Empirical Approach Nov 7, 2021 Deep Reinforcement Learning Management
— Unverified 0Automatic Goal Generation using Dynamical Distance Learning Nov 7, 2021 Decision Making Reinforcement Learning (RL)
— Unverified 0Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments Nov 7, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Optimization of the Model Predictive Control Meta-Parameters Through Reinforcement Learning Nov 7, 2021 Model Predictive Control reinforcement-learning
— Unverified 0AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks Nov 6, 2021 Management Reinforcement Learning (RL)
— Unverified 0d3rlpy: An Offline Deep Reinforcement Learning Library Nov 6, 2021 D4RL Deep Reinforcement Learning
Code Code Available 0Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning Nov 6, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Deep Reinforcement Learning Approach for Composing Moving IoT Services Nov 6, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Development of collective behavior in newborn artificial agents Nov 6, 2021 Deep Reinforcement Learning Object Recognition
— Unverified 0Robust Deep Reinforcement Learning for Quadcopter Control Nov 6, 2021 Deep Reinforcement Learning MuJoCo
Code Code Available 1Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space Nov 5, 2021 Reinforcement Learning (RL)
— Unverified 0Supervised Advantage Actor-Critic for Recommender Systems Nov 5, 2021 Q-Learning Recommendation Systems
— Unverified 0Improving RNA Secondary Structure Design using Deep Reinforcement Learning Nov 5, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer Nov 5, 2021 Computed Tomography (CT) Diagnostic
Code Code Available 1Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning Nov 5, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0An Algorithmic Theory of Metacognition in Minds and Machines Nov 5, 2021 Bayesian Optimization Reinforcement Learning (RL)
— Unverified 0Control of a fly-mimicking flyer in complex flow using deep reinforcement learning Nov 4, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Infinite Time Horizon Safety of Bayesian Neural Networks Nov 4, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Successor Feature Neural Episodic Control Nov 4, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Nov 4, 2021 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Model-Free Risk-Sensitive Reinforcement Learning Nov 4, 2021 Decision Making model
— Unverified 0RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning Nov 4, 2021 Decision Making Imitation Learning
Code Code Available 1Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel Nov 4, 2021 Language Acquisition Multi-agent Reinforcement Learning
— Unverified 0Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning Nov 4, 2021 Deep Reinforcement Learning Explainable artificial intelligence
— Unverified 0Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning Nov 4, 2021 Multi-Task Learning Object
— Unverified 0Imagine Networks Nov 4, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles Nov 4, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0B-Pref: Benchmarking Preference-Based Reinforcement Learning Nov 4, 2021 Benchmarking reinforcement-learning
Code Code Available 1Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies Nov 3, 2021 All Benchmarking
— Unverified 0Autonomous Attack Mitigation for Industrial Control Systems Nov 3, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0AlphaD3M: Machine Learning Pipeline Synthesis Nov 3, 2021 AutoML BIG-bench Machine Learning
— Unverified 0Online Service Provisioning in NFV-enabled Networks Using Deep Reinforcement Learning Nov 3, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Tuning the Weights: The Impact of Initial Matrix Configurations on Successor Features Learning Efficacy Nov 3, 2021 Reinforcement Learning (RL) Representation Learning
— Unverified 0What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks Nov 3, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Model-Based Episodic Memory Induces Dynamic Hybrid Controls Nov 3, 2021 model reinforcement-learning
— Unverified 0Smooth Imitation Learning via Smooth Costs and Smooth Policies Nov 3, 2021 continuous-control Continuous Control
— Unverified 0Image-Guided Navigation of a Robotic Ultrasound Probe for Autonomous Spinal Sonography Using a Shadow-aware Dual-Agent Framework Nov 3, 2021 Anatomy Decision Making
— Unverified 0Curriculum Offline Imitation Learning Nov 3, 2021 continuous-control Continuous Control
Code Code Available 1