SOTAVerified

Deep Reinforcement Learning

Papers

Showing 51015150 of 5822 papers

TitleStatusHype
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy CriticCode0
Federated Control with Hierarchical Multi-Agent Deep Reinforcement LearningCode0
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic ManipulationCode0
Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep LearningCode0
Importance Prioritized Policy DistillationCode0
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection ApproachCode0
Deep Reinforcement Learning Methods for Structure-Guided Processing Path OptimizationCode0
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy DistillationCode0
TrojDRL: Trojan Attacks on Deep Reinforcement Learning AgentsCode0
TrolleyMod v1.0: An Open-Source Simulation and Data-Collection Platform for Ethical Decision Making in Autonomous VehiclesCode0
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI gamesCode0
Student-Initiated Action Advising via Advice NoveltyCode0
Correcting Momentum in Temporal Difference LearningCode0
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning SystemsCode0
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient AlgorithmsCode0
Truly Proximal Policy OptimizationCode0
Improving Automatic Source Code Summarization via Deep Reinforcement LearningCode0
Bayesian Optimization with Robust Bayesian Neural NetworksCode0
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement LearningCode0
Fast Matrix Multiplication Without Tears: A Constraint Programming ApproachCode0
Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven CommunicationCode0
Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural ArchitecturesCode0
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnCode0
Urban Driving with Multi-Objective Deep Reinforcement LearningCode0
Deep reinforcement learning from human preferencesCode0
Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum LearningCode0
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking AgentsCode0
Improving Exploration in Soft-Actor-Critic with Normalizing Flows PoliciesCode0
Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement LearningCode0
Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning ApproachCode0
Fast deep reinforcement learning using online adjustments from the pastCode0
Trust Region-Guided Proximal Policy OptimizationCode0
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and ScaleCode0
Trust-Region Twisted Policy ImprovementCode0
Quantum Deep Reinforcement Learning for Robot Navigation TasksCode0
Towards Better Interpretability in Deep Q-NetworksCode0
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement LearningCode0
Conversational Tree Search: A New Hybrid Dialog TaskCode0
Improving Policy Optimization with Generalist-Specialist LearningCode0
Bayesian Optimization for Iterative LearningCode0
Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic NetworkCode0
Budgeted Reinforcement Learning in Continuous State SpaceCode0
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal TeachersCode0
An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement LearningCode0
Deep Reinforcement Learning from Hierarchical Preference DesignCode0
Query-based Targeted Action-Space Adversarial Policies on Deep Reinforcement Learning AgentsCode0
Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement LearningCode0
Queueing Network Controls via Deep Reinforcement LearningCode0
Super Reinforcement Bros: Playing Super Mario Bros with Reinforcement LearningCode0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Show:102550
← PrevPage 103 of 117Next →

No leaderboard results yet.