SOTAVerified

Deep Reinforcement Learning

Papers

Showing 18011850 of 5822 papers

TitleStatusHype
Enabling A Network AI Gym for Autonomous Cyber Agents0
Solving Dynamic Traveling Salesman Problems With Deep Reinforcement LearningCode2
Multi-Microgrid Collaborative Optimization Scheduling Using an Improved Multi-Agent Soft Actor-Critic Algorithm0
Physical Deep Reinforcement Learning Towards Safety Guarantee0
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations0
Quantum Deep Hedging0
On the Use of Reinforcement Learning for Attacking and Defending Load Frequency Control0
Adaptive Background Music for a Fighting Game: A Multi-Instrument Volume Modulation Approach0
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning0
Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning0
Optimal Smoothing Distribution Exploration for Backdoor Neutralization in Deep Learning-based Traffic Systems0
RLOR: A Flexible Framework of Deep Reinforcement Learning for Operation ResearchCode1
Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems0
HAPS-UAV-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach0
Deep Reinforcement Learning for Localizability-Enhanced Navigation in Dynamic Human Environments0
Distributed Two-tier DRL Framework for Cell-Free Network: Association, Beamforming and Power AllocationCode1
P^3O: Transferring Visual Representations for Reinforcement Learning via Prompting0
Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided GuaranteesCode0
Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning0
Bridging Transient and Steady-State Performance in Voltage Control: A Reinforcement Learning Approach with Safe Gradient Flow0
Multi-modal reward for visual relationships-based image captioning0
Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning0
Mobile Edge Adversarial Detection for Digital Twinning to the Metaverse with Deep Reinforcement Learning0
Conversational Tree Search: A New Hybrid Dialog TaskCode0
Measurement Optimization under Uncertainty using Deep Reinforcement Learning0
Energy Management of Multi-mode Plug-in Hybrid Electric Vehicle using Multi-agent Deep Reinforcement Learning0
Efficient Learning of High Level Plans from Play0
Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics0
Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots0
Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning0
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning0
Learning to Transfer In-Hand Manipulations Using a Greedy Shape Curriculum0
Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite CommunicationsCode0
Loss of Plasticity in Continual Deep Reinforcement Learning0
Synthetic Experience ReplayCode1
AutoDenoise: Automatic Data Instance Denoising for Recommendations0
Towards Practical Multi-Robot Hybrid Tasks Allocation for Autonomous CleaningCode1
Deep Reinforcement Learning Based Power Allocation for Minimizing AoI and Energy Consumption in MIMO-NOMA IoT Systems0
Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning0
Solving routing problems for multiple cooperative Unmanned Aerial Vehicles using Transformer networks, vol. 122, pp. 106085, 2023Code0
Conceptual Reinforcement Learning for Language-Conditioned Tasks0
Quantum Power Electronics: From Theory to Implementation0
Using Memory-Based Learning to Solve Tasks with State-Action Constraints0
Virtual Reality in Metaverse over Wireless Networks with User-centered Deep Reinforcement Learning0
Learning Bipedal Walking for Humanoids with Current FeedbackCode3
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious PlayCode1
MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy0
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments0
Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical lawsCode3
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control TasksCode0
Show:102550
← PrevPage 37 of 117Next →

No leaderboard results yet.