SOTAVerified

Deep Reinforcement Learning

Papers

Showing 51100 of 5822 papers

TitleStatusHype
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter AircraftsCode2
Accelerated Policy Learning with Parallel Differentiable SimulationCode2
Flow: A Modular Learning Framework for Mixed Autonomy TrafficCode2
MaskPlace: Fast Chip Placement via Reinforced Visual Representation LearningCode2
Safety-Driven Deep Reinforcement Learning Framework for Cobots: A Sim2Real ApproachCode2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement LearningCode2
Efficient World Models with Context-Aware TokenizationCode2
FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative FinanceCode2
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement LearningCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and ExperimentationCode2
Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk ManagementCode2
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics ModelsCode2
Decoupling Representation Learning from Reinforcement LearningCode2
DayDreamer: World Models for Physical Robot LearningCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge ComputingCode2
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement LearningCode2
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSICode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
Conformal Symplectic Optimization for Stable Reinforcement LearningCode2
Assessment of Reinforcement Learning for Macro PlacementCode2
Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation NetworkCode2
Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot NavigationCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
Flightmare: A Flexible Quadrotor SimulatorCode2
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement LearningCode2
Accelerated Methods for Deep Reinforcement LearningCode2
Bridging State and History Representations: Understanding Self-Predictive RLCode1
AADG: Automatic Augmentation for Domain Generalization on Retinal Image SegmentationCode1
MPC-Inspired Reinforcement Learning for Verifiable Model-Free ControlCode1
Bridging Imagination and Reality for Model-Based Deep Reinforcement LearningCode1
Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D UltrasoundCode1
Bridging RL Theory and Practice with the Effective HorizonCode1
Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement LearningCode1
A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement LearningCode1
Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning ApproachCode1
BOHB: Robust and Efficient Hyperparameter Optimization at ScaleCode1
A2C is a special case of PPOCode1
Affordance Learning from Play for Sample-Efficient Policy LearningCode1
A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty QuantificationCode1
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the PastCode1
Building a 3-Player Mahjong AI using Deep Reinforcement LearningCode1
Adversarial Policy Gradient for Deep Learning Image AugmentationCode1
Adversarial Policies: Attacking Deep Reinforcement LearningCode1
Benchmarking Reinforcement Learning Techniques for Autonomous NavigationCode1
Adversarially Guided Actor-CriticCode1
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor EnvironmentsCode1
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PCCode1
BeBold: Exploration Beyond the Boundary of Explored RegionsCode1
Show:102550
← PrevPage 2 of 117Next →

No leaderboard results yet.