DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV Aug 4, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts Aug 4, 2022 Generative Adversarial Network Model-based Reinforcement Learning
— Unverified 0Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment Aug 4, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows Aug 4, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Transferable Multi-Agent Reinforcement Learning with Dynamic Participating Agents Aug 4, 2022 Few-Shot Learning Multi-agent Reinforcement Learning
— Unverified 0Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess Aug 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Joint V2I Network Selection and Autonomous Driving Policies Aug 3, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning Aug 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework Aug 3, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0A Lightweight Transmission Parameter Selection Scheme Using Reinforcement Learning for LoRaWAN Aug 3, 2022 Fairness reinforcement-learning
— Unverified 0Joint Sensing and Communications for Deep Reinforcement Learning-based Beam Management in 6G Aug 3, 2022 Clustering Deep Reinforcement Learning
— Unverified 0Chemotaxis of sea urchin sperm cells through deep reinforcement learning Aug 2, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task Scheduling Aug 2, 2022 Q-Learning reinforcement-learning
— Unverified 0Smart caching in a Data Lake for High Energy Physics analysis Aug 2, 2022 Management reinforcement-learning
— Unverified 0Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach Aug 1, 2022 continuous-control Continuous Control
Code Code Available 0VacciNet: Towards a Smart Framework for Learning the Distribution Chain Optimization of Vaccines for a Pandemic Aug 1, 2022 Reinforcement Learning (RL)
— Unverified 0Retrieval of surgical phase transitions using reinforcement learning Aug 1, 2022 Multi-class Classification reinforcement-learning
— Unverified 0Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot Aug 1, 2022 Deep Reinforcement Learning Friction
— Unverified 0Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning Aug 1, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning Aug 1, 2022 Asset Management Deep Reinforcement Learning
— Unverified 0Learning to generate Reliable Broadcast Algorithms Jul 31, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination Jul 31, 2022 Imitation Learning reinforcement-learning
— Unverified 0Using Chatbots to Teach Languages Jul 31, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Solving the vehicle routing problem with deep reinforcement learning Jul 30, 2022 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Reinforcement learning with experience replay and adaptation of action dispersion Jul 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes Jul 30, 2022 Decision Making reinforcement-learning
— Unverified 0Deep Reinforcement Learning for System-on-Chip: Myths and Realities Jul 29, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization Jul 29, 2022 Deep Reinforcement Learning MuJoCo
Code Code Available 0Combining Evolutionary Search with Behaviour Cloning for Procedurally Generated Content Jul 29, 2022 Reinforcement Learning (RL) valid
— Unverified 0Meta Reinforcement Learning with Successor Feature Based Context Jul 29, 2022 continuous-control Continuous Control
— Unverified 0Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions Jul 29, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis Jul 29, 2022 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits Jul 28, 2022 Model-based Reinforcement Learning Multi-Armed Bandits
— Unverified 0Playing a 2D Game Indefinitely using NEAT and Reinforcement Learning Jul 28, 2022 Q-Learning reinforcement-learning
— Unverified 0RangL: A Reinforcement Learning Competition Platform Jul 28, 2022 OpenAI Gym reinforcement-learning
— Unverified 0Latent Properties of Lifelong Learning Systems Jul 28, 2022 Lifelong learning reinforcement-learning
— Unverified 0Graph Inverse Reinforcement Learning from Diverse Videos Jul 28, 2022 Diversity reinforcement-learning
— Unverified 0Dynamic Shielding for Reinforcement Learning in Black-Box Environments Jul 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control Jul 27, 2022 continuous-control Continuous Control
— Unverified 0A Contact-Safe Reinforcement Learning Framework for Contact-Rich Robot Manipulation Jul 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0POSET-RL: Phase ordering for Optimizing Size and Execution Time using Reinforcement Learning Jul 27, 2022 CPU reinforcement-learning
— Unverified 0Structural Similarity for Improved Transfer in Reinforcement Learning Jul 27, 2022 Q-Learning reinforcement-learning
— Unverified 0Multi-Objective Provisioning of Network Slices using Deep Reinforcement Learning Jul 27, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms Jul 27, 2022 continuous-control Continuous Control
Code Code Available 0Unsupervised Training for Neural TSP Solver Jul 27, 2022 Graph Neural Network reinforcement-learning
— Unverified 0Semi-analytical Industrial Cooling System Model for Reinforcement Learning Jul 26, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning at Multiple Frequencies Jul 26, 2022 Offline RL reinforcement-learning
— Unverified 0Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature Jul 26, 2022 Autonomous Vehicles reinforcement-learning
— Unverified 0Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning Jul 26, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy Jul 25, 2022 continuous-control Continuous Control
Code Code Available 0