SOTAVerified

Deep Reinforcement Learning

Papers

Showing 401425 of 5822 papers

TitleStatusHype
Uniformly Conservative Exploration in Reinforcement LearningCode1
Recurrent Off-policy Baselines for Memory-based Continuous ControlCode1
An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agentsCode1
MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement LearningCode1
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent DemonstrationsCode1
Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation TasksCode1
Replay-Guided Adversarial Environment DesignCode1
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing ProblemCode1
Continuous-Time Fitted Value Iteration for Robust PoliciesCode1
Large Batch Experience ReplayCode1
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley ValuesCode1
Unified Data Collection for Visual-Inertial Calibration via Deep Reinforcement LearningCode1
HyperDQN: A Randomized Exploration Method for Deep Reinforcement LearningCode1
Emergent behavior and neural dynamics in artificial agents tracking turbulent plumesCode1
Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based NavigationCode1
ENERO: Efficient Real-Time WAN Routing Optimization with Deep Reinforcement LearningCode1
Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree SearchCode1
Focus on Impact: Indoor Exploration with Intrinsic MotivationCode1
Learning to Navigate Intersections with Unsupervised Driver Trait InferenceCode1
Learning Selective Communication for Multi-Agent Path FindingCode1
DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioningCode1
Optimizing Quantum Variational Circuits with Deep Reinforcement LearningCode1
Hierarchical Object-to-Zone Graph for Object NavigationCode1
Learning to Synthesize Programs as Interpretable and Generalizable PoliciesCode1
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPUCode1
Show:102550
← PrevPage 17 of 233Next →

No leaderboard results yet.