SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 76017625 of 15113 papers

TitleStatusHype
Generating Interpretable Fuzzy Controllers using Particle Swarm Optimization and Genetic Programming0
Generating Paraphrases with Lean Vocabulary0
Improving Factual Consistency Between a Response and Persona Facts0
Generating Rescheduling Knowledge using Reinforcement Learning in a Cognitive Architecture0
Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles0
Generating stable molecules using imitation and reinforcement learning0
Generating Student Feedback from Time-Series Data Using Reinforcement Learning0
Generating Text with Deep Reinforcement Learning0
Generation of Policy-Level Explanations for Reinforcement Learning0
Generative Adversarial Exploration for Reinforcement Learning0
Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning0
Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate0
Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate0
Generative Adversarial Imitation Learning for End-to-End Autonomous Driving on Urban Environments0
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference0
Generative Adversarial Self-Imitation Learning0
Generative Adversarial Simulator0
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs0
Generative Exploration and Exploitation0
Generative Inverse Deep Reinforcement Learning for Online Recommendation0
Generative Job Recommendations with Large Language Model0
Generative Memory for Lifelong Reinforcement Learning0
Generative methods for sampling transition paths in molecular dynamics0
Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks0
Generative Slate Recommendation with Reinforcement Learning0
Show:102550
← PrevPage 305 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified