Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2026–2050 of 15113 papers

Title	Date	Tasks	Status	Hype	Score
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning	May 19, 2021	Density EstimationReinforcement Learning (RL)	CodeCode Available	1	5
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning	Jul 2, 2021	BenchmarkingCausal Discovery	CodeCode Available	1	5
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach	Nov 14, 2021	Algorithmic TradingGeneral Reinforcement Learning	CodeCode Available	1	5
Improving and Benchmarking Offline Reinforcement Learning Algorithms	Jun 1, 2023	AttributeBenchmarking	CodeCode Available	1	5
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings	Mar 4, 2021	Atari GamesComputational Efficiency	CodeCode Available	1	5
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning	Oct 10, 2022	Data Augmentationreinforcement-learning	CodeCode Available	1	5
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes	Jun 19, 2020	Continual LearningDecision Making	CodeCode Available	1	5
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning	Jul 8, 2022	DiversityMulti-agent Reinforcement Learning	CodeCode Available	1	5
Interactive Machine Learning of Musical Gesture	Nov 26, 2020	BIG-bench Machine LearningReinforcement Learning (RL)	CodeCode Available	1	5
Adaptive Transformers in RL	Apr 8, 2020	Partially Observable Reinforcement Learningreinforcement-learning	CodeCode Available	1	5
Behavior From the Void: Unsupervised Active Pre-Training	Mar 8, 2021	Atari GamesReinforcement Learning (RL)	CodeCode Available	1	5
Analytical Lyapunov Function Discovery: An RL-based Generative Approach	Feb 4, 2025	Reinforcement Learning (RL)valid	CodeCode Available	1	5
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture	May 28, 2021	Meta Reinforcement LearningMuJoCo	CodeCode Available	1	5
Tell me why! Explanations support learning relational and causal structure	Dec 7, 2021	Odd One OutReinforcement Learning (RL)	CodeCode Available	1	5
TEMPERA: Test-Time Prompting via Reinforcement Learning	Nov 21, 2022	Few-Shot LearningNatural Language Inference	CodeCode Available	1	5
Behavior Proximal Policy Optimization	Feb 22, 2023	D4RLOffline RL	CodeCode Available	1	5
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint	Jan 11, 2024	Question AnsweringReinforcement Learning (RL)	CodeCode Available	1	5
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents	Jan 11, 2024	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1	5
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization	Jul 11, 2020	Bayesian OptimizationData Augmentation	CodeCode Available	1	5
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines	Oct 8, 2020	Common Sense ReasoningCommonsense Reasoning for RL	CodeCode Available	1	5
Text Generation by Learning from Demonstrations	Sep 16, 2020	Machine TranslationQuestion Generation	CodeCode Available	1	5
BIMRL: Brain Inspired Meta Reinforcement Learning	Oct 29, 2022	Meta Reinforcement Learningreinforcement-learning	CodeCode Available	1	5
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning	Feb 8, 2022	continuous-controlContinuous Control	CodeCode Available	1	5
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning	Jun 3, 2021	Deep Reinforcement LearningModel-based Reinforcement Learning	CodeCode Available	1	5
Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning	Feb 15, 2021	Deep Reinforcement LearningMulti-agent Reinforcement Learning	CodeCode Available	1	5

Show:10 25 50

← PrevPage 82 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified