SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 91769200 of 15113 papers

TitleStatusHype
CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning for Demand Response and Urban Energy ManagementCode1
Hierarchical principles of embodied reinforcement learning: A review0
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting AgentCode1
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning0
Reinforcement Learning for Test Case Prioritization0
Improving the Efficient Neural Architecture Search via Rewarding ModificationsCode0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
High-Throughput Synchronous Deep RLCode1
Interactive Question Clarification in Dialogue via Reinforcement Learning0
Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement LearningCode1
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey0
MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning0
ViNG: Learning Open-World Navigation with Visual Goals0
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FELCode0
Towards Optimal District Heating Temperature Control in China with Deep Reinforcement Learning0
CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving0
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data0
A comparative evaluation of machine learning methods for robot navigation through human crowds0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Learning Accurate Long-term Dynamics for Model-based Reinforcement LearningCode2
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation0
Grounding Artificial Intelligence in the Origins of Human Behavior0
Gegelati: Lightweight Artificial Intelligence through Generic and Evolvable Tangled Program Graphs0
Show:102550
← PrevPage 368 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified