SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1022610250 of 15113 papers

TitleStatusHype
Neural Topic Model with Reinforcement Learning0
Neural-to-Tree Policy Distillation with Policy Improvement Criterion0
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy0
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment0
NeurIPS 2022 Competition: Driving SMARTS0
NeuRL: Closed-form Inverse Reinforcement Learning for Neural Decoding0
Neuroevolution-Based Inverse Reinforcement Learning0
Neuromechanics-based Deep Reinforcement Learning of Neurostimulation Control in FES cycling0
Neuromuscular Reinforcement Learning to Actuate Human Limbs through FES0
Neuron Activation Analysis for Multi-Joint Robot Reinforcement Learning0
Neuron as an Agent0
Neuroprospecting with DeepRL agents0
Neuro-Symbolic Hierarchical Rule Induction0
Neuro-symbolic Meta Reinforcement Learning for Trading0
Neurosymbolic Reinforcement Learning and Planning: A Survey0
Neuro-Symbolic Reinforcement Learning with First-Order Logic0
Neuro-Symbolic World Models for Adapting to Open World Novelty0
NeuSaver: Neural Adaptive Power Consumption Optimization for Mobile Video Streaming0
Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions0
New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning0
New Challenges in Reinforcement Learning: A Survey of Security and Privacy0
New Reinforcement Learning Using a Chaotic Neural Network for Emergence of "Thinking" - "Exploration" Grows into "Thinking" through Learning -0
News-based trading strategies0
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks0
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs0
Show:102550
← PrevPage 410 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified