SOTAVerified

Deep Reinforcement Learning

Papers

Showing 51015125 of 5822 papers

TitleStatusHype
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy CriticCode0
Federated Control with Hierarchical Multi-Agent Deep Reinforcement LearningCode0
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic ManipulationCode0
Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep LearningCode0
Importance Prioritized Policy DistillationCode0
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection ApproachCode0
Deep Reinforcement Learning Methods for Structure-Guided Processing Path OptimizationCode0
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy DistillationCode0
TrojDRL: Trojan Attacks on Deep Reinforcement Learning AgentsCode0
TrolleyMod v1.0: An Open-Source Simulation and Data-Collection Platform for Ethical Decision Making in Autonomous VehiclesCode0
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI gamesCode0
Student-Initiated Action Advising via Advice NoveltyCode0
Correcting Momentum in Temporal Difference LearningCode0
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning SystemsCode0
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient AlgorithmsCode0
Truly Proximal Policy OptimizationCode0
Improving Automatic Source Code Summarization via Deep Reinforcement LearningCode0
Bayesian Optimization with Robust Bayesian Neural NetworksCode0
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement LearningCode0
Fast Matrix Multiplication Without Tears: A Constraint Programming ApproachCode0
Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven CommunicationCode0
Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural ArchitecturesCode0
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnCode0
Urban Driving with Multi-Objective Deep Reinforcement LearningCode0
Deep reinforcement learning from human preferencesCode0
Show:102550
← PrevPage 205 of 233Next →

No leaderboard results yet.