SOTAVerified

Deep Reinforcement Learning

Papers

Showing 51100 of 5822 papers

TitleStatusHype
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and ExperimentationCode2
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter AircraftsCode2
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSICode2
Transformers are Sample-Efficient World ModelsCode2
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement LearningCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement LearningCode2
DayDreamer: World Models for Physical Robot LearningCode2
Accelerated Policy Learning with Parallel Differentiable SimulationCode2
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningCode2
Reinforcement Learning TextbookCode2
FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative FinanceCode2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement LearningCode2
Learning Efficient Online 3D Bin Packing on Packing Configuration TreesCode2
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement LearningCode2
Learning Practically Feasible Policies for Online 3D Bin PackingCode2
Habitat 2.0: Training Home Assistants to Rearrange their HabitatCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
Model-agnostic and Scalable Counterfactual Explanations via Reinforcement LearningCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Flightmare: A Flexible Quadrotor SimulatorCode2
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorchCode2
Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous VehiclesCode2
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics ModelsCode2
Accelerated Methods for Deep Reinforcement LearningCode2
Flow: A Modular Learning Framework for Mixed Autonomy TrafficCode2
Benchmarking Deep Reinforcement Learning for Continuous ControlCode2
Deep Reinforcement Learning with Gradient Eligibility TracesCode1
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement LearningCode1
The Cell Must Go On: Agar.io for Continual Reinforcement LearningCode1
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution StrategyCode1
Reasoning on a Budget: Miniaturizing DeepSeek R1 with SFT-GRPO Alignment for Instruction-Tuned LLMsCode1
Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field TestsCode1
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial ExplorationCode1
Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson DiseaseCode1
Learning Decision Trees as Amortized Structure InferenceCode1
Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement LearningCode1
Studying the Interplay Between the Actor and Critic Representations in Reinforcement LearningCode1
Playing Pokémon Red via Deep Reinforcement LearningCode1
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic EnvironmentsCode1
Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement ErrorCode1
Reevaluating Policy Gradient Methods for Imperfect-Information GamesCode1
A Comprehensive Survey on Self-Interpretable Neural NetworksCode1
Divergence-Augmented Policy OptimizationCode1
Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial NetworksCode1
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement LearningCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM GreaterCode1
GRAM: Generalization in Deep RL with a Robust Adaptation ModuleCode1
Show:102550
← PrevPage 2 of 117Next →

No leaderboard results yet.