SOTAVerified

Deep Reinforcement Learning

Papers

Showing 551600 of 5822 papers

TitleStatusHype
Personalised Meta-path Generation for Heterogeneous GNNsCode1
Learning Guidance Rewards with Trajectory-space SmoothingCode1
Bridging Imagination and Reality for Model-Based Deep Reinforcement LearningCode1
Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement LearningCode1
Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement LearningCode1
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle RoutingCode1
Improving Generalization in Reinforcement Learning with Mixture RegularizationCode1
Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforcement Learning ApproachCode1
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement LearningCode1
Iterative Amortized Policy OptimizationCode1
Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous ControlCode1
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous ControlCode1
UAV Path Planning using Global and Local Map Information with Deep Reinforcement LearningCode1
Deep Reinforcement Learning for Real-Time Optimization of Pumps in Water Distribution SystemsCode1
A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q NetworkCode1
Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V CommunicationCode1
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement LearningCode1
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological ModelsCode1
Prioritized Level ReplayCode1
A Traffic Light Dynamic Control Algorithm with Deep Reinforcement Learning Based on GNN PredictionCode1
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy DecompositionCode1
Deep Reinforcement Learning for Process SynthesisCode1
SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement LearningCode1
Meta-AAD: Active Anomaly Detection with Deep Reinforcement LearningCode1
Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly DataCode1
Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile NetworksCode1
Deep Active Inference for Partially Observable MDPsCode1
Learning Dexterous Grasping with Object-Centric Visual AffordancesCode1
Sample-Efficient Automated Deep Reinforcement LearningCode1
AllenAct: A Framework for Embodied AI ResearchCode1
Learning Off-Policy with Online PlanningCode1
Social-Aware Incentive Mechanism for VehicularCrowdsensing by Deep Reinforcement LearningCode1
Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learningCode1
TriFinger: An Open-Source Robot for Learning DexterityCode1
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Robust Deep Reinforcement Learning through Adversarial LossCode1
Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement LearningCode1
Combining Deep Reinforcement Learning and Search for Imperfect-Information GamesCode1
Monte-Carlo Tree Search as Regularized Policy OptimizationCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
Integrating Deep Reinforcement Learning Networks with Health System SimulationsCode1
Joint Trajectory and Passive Beamforming Design for Intelligent Reflecting Surface-Aided UAV Communications: A Deep Reinforcement Learning ApproachCode1
Weighing Counts: Sequential Crowd Counting by Reinforcement LearningCode1
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Long-Term Planning with Deep Reinforcement Learning on Autonomous DronesCode1
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement LearningCode1
Guided Exploration with Proximal Policy Optimization using a Single DemonstrationCode1
LFQ: Online Learning of Per-flow Queuing Policies using Deep Reinforcement LearningCode1
Verifiably Safe Exploration for End-to-End Reinforcement LearningCode1
Show:102550
← PrevPage 12 of 117Next →

No leaderboard results yet.