Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–510 of 15113 papers

Title	Date	Tasks	Status	Hype
StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models	Oct 10, 2024	Question AnsweringReinforcement Learning (RL)	CodeCode Available	1
Retrieval-Augmented Decision Transformer: External Memory for In-context RL	Oct 9, 2024	In-Context LearningReinforcement Learning (RL)	CodeCode Available	1
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning	Oct 8, 2024	GSM8KMulti-agent Reinforcement Learning	CodeCode Available	1
GreenLight-Gym: Reinforcement learning benchmark environment for control of greenhouse production systems	Oct 6, 2024	Numerical IntegrationReinforcement Learning (RL)	CodeCode Available	1
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization	Oct 4, 2024	Deep Reinforcement LearningQuantization	CodeCode Available	1
Predictive Coding for Decision Transformer	Oct 4, 2024	Decision MakingReinforcement Learning (RL)	CodeCode Available	1
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI	Oct 3, 2024	Few-Shot Imitation LearningImitation Learning	CodeCode Available	1
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining	Oct 1, 2024	Atari Gamesmodel	CodeCode Available	1
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models	Sep 27, 2024	Reinforcement Learning (RL)World Knowledge	CodeCode Available	1
ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning	Sep 27, 2024	AutoMLBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 51 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified