SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 476500 of 15113 papers

TitleStatusHype
Deep Deterministic Portfolio OptimizationCode1
Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning ApproachCode1
Deep Implicit Coordination Graphs for Multi-agent Reinforcement LearningCode1
Curious Hierarchical Actor-Critic Reinforcement LearningCode1
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement LearningCode1
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character SkillsCode1
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
Cryptocurrency Portfolio Management with Deep Reinforcement LearningCode1
Automatic Data Augmentation for Generalization in Deep Reinforcement LearningCode1
Automatic Data Augmentation for Generalization in Reinforcement LearningCode1
Deep Multi-agent Reinforcement Learning for Highway On-Ramp Merging in Mixed TrafficCode1
Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement LearningCode1
Automatic Truss Design with Reinforcement LearningCode1
Automating DBSCAN via Deep Reinforcement LearningCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial MarketsCode1
Deep Reinforcement Learning based Evasion Generative Adversarial Network for Botnet DetectionCode1
Autonomous Reinforcement Learning: Formalism and BenchmarkingCode1
Autonomous Racing using a Hybrid Imitation-Reinforcement Learning ArchitectureCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement LearningCode1
Avalon: A Benchmark for RL Generalization Using Procedurally Generated WorldsCode1
Deep Reinforcement Learning for Active Human Pose EstimationCode1
CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement LearningCode1
Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement LearningCode1
Show:102550
← PrevPage 20 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified