SOTAVerified

Deep Reinforcement Learning

Papers

Showing 47014750 of 5822 papers

TitleStatusHype
Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation0
Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression and Challenge0
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement LearningCode0
Efficient Intrinsically Motivated Robotic Grasping with Learning-Adaptive Imagination in Latent Space0
Hierarchical Deep Double Q-Routing0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Defensive Escort Teams via Multi-Agent Deep Reinforcement Learning0
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals0
Policies Modulating Trajectory GeneratorsCode0
DeepMNavigate: Deep Reinforced Multi-Robot Navigation Unifying Local & Global Collision Avoidance0
Deep Q-Network for Angry BirdsCode0
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning0
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement LearningCode0
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots0
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification0
End-to-End Motion Planning of Quadrotors Using Deep Reinforcement Learning0
Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving0
Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning0
Relational Graph Learning for Crowd NavigationCode0
How to Evaluate Machine Learning Approaches for Combinatorial Optimization: Application to the Travelling Salesman ProblemCode0
Deep Reinforcement Learning Based Power control for Wireless Multicast Systems0
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning0
Counterfactual States for Atari Agents via Generative Deep Learning0
Harnessing Structures for Value-Based Planning and Reinforcement LearningCode0
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous ControlCode0
QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error0
Evo-NAS: Evolutionary-Neural Hybrid Agent for Architecture Search0
Assessing Generalization in TD methods for Deep Reinforcement Learning0
Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients0
Zero-Shot Policy Transfer with Disentangled Attention0
Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning0
HIPPOCAMPAL NEURONAL REPRESENTATIONS IN CONTINUAL LEARNING0
Learning Semantically Meaningful Representations Through Embodiment0
Improving Exploration of Deep Reinforcement Learning using Planning for Policy Search0
Learning Latent Representations for Inverse Dynamics using Generalized Experiences0
Long-term planning, short-term adjustments0
D3PG: Deep Differentiable Deterministic Policy Gradients0
Learning Key Steps to Attack Deep Reinforcement Learning Agents0
Do recent advancements in model-based deep reinforcement learning really improve data efficiency?0
C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement LearningCode0
Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone MicrocontrollerCode0
Deep Auto-Deferring Policy for Combinatorial Optimization0
Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning0
How many weights are enough : can tensor factorization learn efficient policies ?0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
ROS-HPL: Robotic Object Search with Hierarchical Policy Learning and Intrinsic-Extrinsic Modeling0
MoET: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees0
Controlling an Autonomous Vehicle with Deep Reinforcement Learning0
Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement LearningCode0
Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches0
Show:102550
← PrevPage 95 of 117Next →

No leaderboard results yet.