SOTAVerified

Deep Reinforcement Learning

Papers

Showing 401450 of 5822 papers

TitleStatusHype
Recurrent Off-policy Baselines for Memory-based Continuous ControlCode1
Uniformly Conservative Exploration in Reinforcement LearningCode1
An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agentsCode1
MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement LearningCode1
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent DemonstrationsCode1
Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation TasksCode1
Replay-Guided Adversarial Environment DesignCode1
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing ProblemCode1
Continuous-Time Fitted Value Iteration for Robust PoliciesCode1
Large Batch Experience ReplayCode1
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley ValuesCode1
Unified Data Collection for Visual-Inertial Calibration via Deep Reinforcement LearningCode1
HyperDQN: A Randomized Exploration Method for Deep Reinforcement LearningCode1
Emergent behavior and neural dynamics in artificial agents tracking turbulent plumesCode1
Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based NavigationCode1
ENERO: Efficient Real-Time WAN Routing Optimization with Deep Reinforcement LearningCode1
Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree SearchCode1
Learning to Navigate Intersections with Unsupervised Driver Trait InferenceCode1
Focus on Impact: Indoor Exploration with Intrinsic MotivationCode1
Learning Selective Communication for Multi-Agent Path FindingCode1
DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioningCode1
Optimizing Quantum Variational Circuits with Deep Reinforcement LearningCode1
Hierarchical Object-to-Zone Graph for Object NavigationCode1
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPUCode1
Learning to Synthesize Programs as Interpretable and Generalizable PoliciesCode1
Deep Reinforcement Learning at the Edge of the Statistical PrecipiceCode1
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line SystemsCode1
Responsive Regulation of Dynamic UAV Communication Networks Based on Deep Reinforcement LearningCode1
Diversity-based Trajectory and Goal Selection with Hindsight Experience ReplayCode1
Safe Deep Reinforcement Learning for Multi-Agent Systems with Continuous Action SpacesCode1
The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement LearningCode1
Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward AlgorithmCode1
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated EnvironmentsCode1
Co-designing Intelligent Control of Building HVACs and MicrogridsCode1
A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty QuantificationCode1
ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image EnhancementCode1
Distributed Online Service Coordination Using Deep Reinforcement LearningCode1
Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement LearningCode1
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and ExploitationCode1
Predicting Human Scanpaths in Visual Question AnsweringCode1
Towards Safe Reinforcement Learning via Constraining Conditional Value at RiskCode1
Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image TranslationCode1
Deep Reinforcement Learning for Conservation DecisionsCode1
Deep Reinforcement Learning based Group Recommender SystemCode1
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationCode1
Pretraining Representations for Data-Efficient Reinforcement LearningCode1
Pretrained Encoders are All You NeedCode1
Learning Markov State Abstractions for Deep Reinforcement LearningCode1
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement LearningCode1
Dynamic Sparse Training for Deep Reinforcement LearningCode1
Show:102550
← PrevPage 9 of 117Next →

No leaderboard results yet.