SOTAVerified

Deep Reinforcement Learning

Papers

Showing 150 of 5822 papers

TitleStatusHype
Symmetry Considerations for Learning Task Symmetric Robot PoliciesCode7
The Dormant Neuron Phenomenon in Deep Reinforcement LearningCode6
Dynamic Datasets and Market Environments for Financial Reinforcement LearningCode6
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement LearningCode6
That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip DesignCode5
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to RealityCode4
Discovering faster matrix multiplication algorithms with reinforcement learningCode4
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization BenchmarkCode4
Learning Bipedal Walking for Humanoids with Current FeedbackCode3
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning AlgorithmsCode3
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement LearningCode3
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment LocomotionCode3
Dopamine: A Research Framework for Deep Reinforcement LearningCode3
Learning Bipedal Walking On Planned Footsteps For Humanoid RobotsCode3
Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical lawsCode3
Deep Reinforcement LearningCode3
Streaming Deep Reinforcement Learning Finally WorksCode3
Tianshou: a Highly Modularized Deep Reinforcement Learning LibraryCode3
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative FinanceCode3
Distributed Prioritized Experience ReplayCode3
Practical Deep Reinforcement Learning Approach for Stock TradingCode3
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
Class Symbolic Regression: Gotta Fit 'Em AllCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement LearningCode2
Learning Efficient Online 3D Bin Packing on Packing Configuration TreesCode2
MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at ScaleCode2
Learning Practically Feasible Policies for Online 3D Bin PackingCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X CommunicationsCode2
Flow: A Modular Learning Framework for Mixed Autonomy TrafficCode2
Habitat 2.0: Training Home Assistants to Rearrange their HabitatCode2
FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative FinanceCode2
Efficient World Models with Context-Aware TokenizationCode2
Flightmare: A Flexible Quadrotor SimulatorCode2
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter AircraftsCode2
MaskPlace: Fast Chip Placement via Reinforced Visual Representation LearningCode2
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and ExperimentationCode2
Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk ManagementCode2
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics ModelsCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSICode2
Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot NavigationCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement LearningCode2
Accelerated Policy Learning with Parallel Differentiable SimulationCode2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge ComputingCode2
Show:102550
← PrevPage 1 of 117Next →

No leaderboard results yet.