Few-Shot Preference Learning for Human-in-the-Loop RL Dec 6, 2022 Meta-Learning Multi-Task Learning
— Unverified 0Adaptive Risk-Aware Bidding with Budget Constraint in Display Advertising Dec 6, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks Dec 6, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Active Classification of Moving Targets with Learned Control Policies Dec 6, 2022 Classification Reinforcement Learning (RL)
— Unverified 0A Learned Simulation Environment to Model Plant Growth in Indoor Farming Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning Dec 6, 2022 Deep Reinforcement Learning Imitation Learning
— Unverified 0L2SR: Learning to Sample and Reconstruct for Accelerated MRI via Reinforcement Learning Dec 5, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control Dec 5, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance Dec 5, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat Dec 5, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Machine with Short-Term, Episodic, and Semantic Memory Systems Dec 5, 2022 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Bi-Level Optimization Augmented with Conditional Variational Autoencoder for Autonomous Driving in Dense Traffic Dec 5, 2022 Autonomous Driving GPU
— Unverified 0Differentiated Federated Reinforcement Learning Based Traffic Offloading on Space-Air-Ground Integrated Networks Dec 5, 2022 Fairness Federated Learning
— Unverified 0Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation Dec 5, 2022 Benchmarking Binary Classification
— Unverified 0Accelerating Interactive Human-like Manipulation Learning with GPU-based Simulation and High-quality Demonstrations Dec 5, 2022 GPU Imitation Learning
— Unverified 0PowRL: A Reinforcement Learning Framework for Robust Management of Power Networks Dec 5, 2022 Decision Making Management
— Unverified 0TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets Dec 5, 2022 D4RL MuJoCo
Code Code Available 0Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance Dec 4, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Online Shielding for Reinforcement Learning Dec 4, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Automata Learning meets Shielding Dec 4, 2022 Q-Learning Reinforcement Learning (RL)
Code Code Available 0DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning Dec 3, 2022 Autonomous Driving Multi-agent Reinforcement Learning
— Unverified 0Constrained Reinforcement Learning via Dissipative Saddle Flow Dynamics Dec 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward Dec 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning Dec 2, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0On the Energy and Communication Efficiency Tradeoffs in Federated and Multi-Task Learning Dec 2, 2022 Federated Learning Meta-Learning
— Unverified 0STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning Dec 2, 2022 continuous-control Continuous Control
Code Code Available 0Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning Dec 2, 2022 Navigate reinforcement-learning
— Unverified 0Fuse and Adapt: Investigating the Use of Pre-Trained Self-Supervising Learning Models in Limited Data NLU problems Dec 2, 2022 Domain Adaptation Emotion Recognition
— Unverified 0CT-DQN: Control-Tutored Deep Reinforcement Learning Dec 2, 2022 Car Racing Deep Reinforcement Learning
— Unverified 0Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Dec 2, 2022 D4RL reinforcement-learning
— Unverified 0Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox Dec 1, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have Dec 1, 2022 Decision Making reinforcement-learning
— Unverified 0Launchpad: Learning to Schedule Using Offline and Online RL Methods Dec 1, 2022 Deep Reinforcement Learning Offline RL
— Unverified 0Kick-motion Training with DQN in AI Soccer Environment Dec 1, 2022 Reinforcement Learning (RL)
— Unverified 0Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Safe Reinforcement Learning with Probabilistic Control Barrier Functions for Ramp Merging Dec 1, 2022 Autonomous Driving Imitation Learning
— Unverified 0Modeling Mobile Health Users as Reinforcement Learning Agents Dec 1, 2022 Decision Making reinforcement-learning
— Unverified 0Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System Dec 1, 2022 Collision Avoidance Deep Reinforcement Learning
— Unverified 0Policy Optimization over General State and Action Spaces Nov 30, 2022 Reinforcement Learning (RL)
— Unverified 0Targets in Reinforcement Learning to solve Stackelberg Security Games Nov 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Multi-Truck Vehicle Routing Problems Nov 30, 2022 Combinatorial Optimization Decoder
— Unverified 0Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions Nov 30, 2022 Computational Efficiency Efficient Exploration
— Unverified 0Random Copolymer inverse design system orienting on Accurate discovering of Antimicrobial peptide-mimetic copolymers Nov 30, 2022 Activity Prediction Knowledge Distillation
— Unverified 0KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning Nov 30, 2022 Language Modeling Language Modelling
Code Code Available 0Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning Nov 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Welfare and Fairness in Multi-objective Reinforcement Learning Nov 30, 2022 Fairness Multi-Objective Reinforcement Learning
Code Code Available 0General policy mapping: online continual reinforcement learning inspired on the insect brain Nov 30, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Computationally Efficient Reinforcement Learning: Targeted Exploration leveraging Simple Rules Nov 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning Nov 30, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning Nov 30, 2022 Model Discovery Q-Learning
— Unverified 0