SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1415114200 of 15113 papers

TitleStatusHype
Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedbackCode0
Reward Shaping for Human Learning via Inverse Reinforcement LearningCode0
AI Safety GridworldsCode0
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation ProblemCode0
An Optical Control Environment for Benchmarking Reinforcement Learning AlgorithmsCode0
Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and LimitationsCode0
Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural RewardsCode0
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient ExplorationCode0
An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a GridCode0
Diversity-based Deep Reinforcement Learning Towards Multidimensional Difficulty for Fighting Game AICode0
LatentPoison - Adversarial Attacks On The Latent SpaceCode0
Diversity-Driven Extensible Hierarchical Reinforcement LearningCode0
Deep Reinforcement Learning for Imbalanced ClassificationCode0
A Nonparametric Off-Policy Policy GradientCode0
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning ApproachCode0
Human-Inspired Framework to Accelerate Reinforcement LearningCode0
Auto.gov: Learning-based Governance for Decentralized Finance (DeFi)Code0
Deep Reinforcement Learning with Modulated Hebbian plus Q Network ArchitectureCode0
Divide-and-Conquer Reinforcement LearningCode0
Leveraging exploration in off-policy algorithms via normalizing flowsCode0
Hierarchical Reinforcement Learning with AI Planning ModelsCode0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
DL2: A Deep Learning-driven Scheduler for Deep Learning ClustersCode0
ACRE: Actor-Critic with Reward-Preserving ExplorationCode0
DM^2: Decentralized Multi-Agent Reinforcement Learning for Distribution MatchingCode0
Adaptive Combination of a Genetic Algorithm and Novelty Search for Deep NeuroevolutionCode0
Deep Reinforcement Learning for General Video Game AICode0
Deep reinforcement learning for feedback control in a collective flashing ratchetCode0
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision ProcessesCode0
Human level control through deep reinforcement learningCode0
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement LearningCode0
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement LearningCode0
AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive CrossbarsCode0
Human-Level Control without Server-Grade HardwareCode0
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement LearningCode0
A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement LearningCode0
Deep Reinforcement Learning for Event-Triggered ControlCode0
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement LearningCode0
Do deep reinforcement learning agents model intentions?Code0
Collision Avoidance Robotics Via Meta-Learning (CARML)Code0
Deep Reinforcement Learning for Efficient Measurement of Quantum DevicesCode0
Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement LearningCode0
A Hybrid Framework for Reinsurance Optimization: Integrating Generative Models and Reinforcement LearningCode0
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?Code0
Collaborative Evolutionary Reinforcement LearningCode0
Does the Adam Optimizer Exacerbate Catastrophic Forgetting?Code0
Deep Reinforcement Learning for Dialogue GenerationCode0
Deep Reinforcement Learning for De-Novo Drug DesignCode0
Collaborative Deep Reinforcement LearningCode0
Adaptive Auxiliary Task Weighting for Reinforcement LearningCode0
Show:102550
← PrevPage 284 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified