Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2826–2850 of 15113 papers

Title	Date	Tasks	Status	Score
Improving the Efficient Neural Architecture Search via Rewarding Modifications	Dec 17, 2020	Neural Architecture Searchreinforcement-learning	CodeCode Available	5
Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation	Aug 29, 2023	Decision MakingNavigate	CodeCode Available	5
A Generalised and Adaptable Reinforcement Learning Stopping Method	May 3, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	5
A General Framework for Structured Learning of Mechanical Systems	Feb 22, 2019	Model-based Reinforcement LearningReinforcement Learning	CodeCode Available	5
RH-Net: Improving Neural Relation Extraction via Reinforcement Learning and Hierarchical Relational Searching	Oct 27, 2020	Denoisingreinforcement-learning	CodeCode Available	5
Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network	Apr 7, 2021	Adversarial AttackDeep Reinforcement Learning	CodeCode Available	5
Improving the Performance of Backward Chained Behavior Trees that use Reinforcement Learning	Dec 27, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	5
Improving Portfolio Optimization Results with Bandit Networks	Oct 5, 2024	Portfolio OptimizationRecommendation Systems	CodeCode Available	5
Improving Policy Optimization with Generalist-Specialist Learning	Jun 26, 2022	Deep Reinforcement LearningImitation Learning	CodeCode Available	5
Improving Post-Processing of Audio Event Detectors Using Reinforcement Learning	Aug 19, 2022	Classificationreinforcement-learning	CodeCode Available	5
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning	Sep 10, 2018	BIG-bench Machine LearningCombinatorial Optimization	CodeCode Available	5
Improving Policy Learning via Language Dynamics Distillation	Sep 30, 2022	NetHackReinforcement Learning (RL)	CodeCode Available	5
Improving reinforcement learning algorithms: towards optimal learning rate policies	Nov 6, 2019	reinforcement-learningReinforcement Learning	CodeCode Available	5
Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning	Mar 25, 2016	reinforcement-learningReinforcement Learning	CodeCode Available	5
Improving Image Captioning with Conditional Generative Adversarial Nets	May 18, 2018	DecoderImage Captioning	CodeCode Available	5
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning	May 25, 2025	Reinforcement Learning (RL)	CodeCode Available	5
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale	Oct 13, 2024	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	5
Improving Reinforcement Learning Based Image Captioning with Natural Language Prior	Sep 13, 2018	Image Captioningreinforcement-learning	CodeCode Available	5
Improving thermal state preparation of Sachdev-Ye-Kitaev model with reinforcement learning on quantum hardware	Jan 20, 2025	Reinforcement Learning (RL)	CodeCode Available	5
A General, Evolution-Inspired Reward Function for Social Robotics	Feb 1, 2022	Cultural Vocal Bursts Intensity PredictionImitation Learning	CodeCode Available	5
Improving Experience Replay through Modeling of Similar Transitions' Sets	Nov 12, 2021	Atari Gamesreinforcement-learning	CodeCode Available	5
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning	May 22, 2017	Information RetrievalQuestion Answering	CodeCode Available	5
Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning	Dec 16, 2023	Autonomous DrivingAutonomous Racing	CodeCode Available	5
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents	Dec 18, 2017	Deep Reinforcement LearningPolicy Gradient Methods	CodeCode Available	5
Ask Before You Act: Generalising to Novel Environments by Asking Questions	Sep 10, 2022	Reinforcement Learning (RL)	CodeCode Available	5

Show:10 25 50

← PrevPage 114 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified