Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 481–490 of 15113 papers

Title	Date	Tasks	Status	Hype
Decision Transformer: Reinforcement Learning via Sequence Modeling	Jun 2, 2021	Atari GamesD4RL	CodeCode Available	1
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning	Mar 28, 2022	Distributional Reinforcement LearningDrone navigation	CodeCode Available	1
A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving	Nov 5, 2019	Automated Theorem ProvingDeep Reinforcement Learning	CodeCode Available	1
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms	Nov 30, 2023	BenchmarkingOpenAI Gym	CodeCode Available	1
Decoupling Value and Policy for Generalization in Reinforcement Learning	Feb 20, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Deep Active Inference for Partially Observable MDPs	Sep 8, 2020	Deep Reinforcement LearningQ-Learning	CodeCode Available	1
DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer Programming for Multi-transfer Truck Freight Delivery	Mar 5, 2021	Deep Reinforcement LearningMulti-agent Reinforcement Learning	CodeCode Available	1
Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning	Jun 19, 2020	Graph Neural NetworkMulti-agent Reinforcement Learning	CodeCode Available	1
A deep inverse reinforcement learning approach to route choice modeling with context-dependent rewards	Jun 18, 2022	Computational EfficiencyDemand Forecasting	CodeCode Available	1
Contrastive Active Inference	Oct 19, 2021	reinforcement-learningReinforcement Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 49 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified