Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 826–850 of 15113 papers

Title	Date	Tasks	Status	Hype
RayNet: A Simulation Platform for Developing Reinforcement Learning-Driven Network Protocols	Feb 9, 2023	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Predictable MDP Abstraction for Unsupervised Model-Based RL	Feb 8, 2023	modelModel-based Reinforcement Learning	CodeCode Available	1
Multi-Task Recommendations with Reinforcement Learning	Feb 7, 2023	Multi-Task LearningRecommendation Systems	CodeCode Available	1
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence	Feb 7, 2023	Continuous ControlMuJoCo	CodeCode Available	1
Two-Stage Constrained Actor-Critic for Short Video Recommendation	Feb 3, 2023	Recommendation Systemsreinforcement-learning	CodeCode Available	1
Learning to Optimize for Reinforcement Learning	Feb 3, 2023	Inductive BiasMeta-Learning	CodeCode Available	1
Mind the Gap: Offline Policy Optimization for Imperfect Rewards	Feb 3, 2023	Reinforcement Learning (RL)	CodeCode Available	1
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning	Feb 2, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	1
Internally Rewarded Reinforcement Learning	Feb 1, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	1
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees	Jan 31, 2023	continuous-controlContinuous Control	CodeCode Available	1
Optimizing DDPM Sampling with Shortcut Fine-Tuning	Jan 31, 2023	DenoisingReinforcement Learning (RL)	CodeCode Available	1
Retrosynthetic Planning with Dual Value Networks	Jan 31, 2023	Drug DiscoveryMulti-step retrosynthesis	CodeCode Available	1
Execution-based Code Generation using Deep Reinforcement Learning	Jan 31, 2023	Code CompletionCode Generation	CodeCode Available	1
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments	Jan 31, 2023	Reinforcement Learning (RL)Retrieval	CodeCode Available	1
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining	Jan 30, 2023	Offline RLreinforcement-learning	CodeCode Available	1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling	Jan 28, 2023	Decision MakingMinecraft	CodeCode Available	1
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation	Jan 27, 2023	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
Deep Laplacian-based Options for Temporally-Extended Exploration	Jan 26, 2023	Reinforcement Learning (RL)	CodeCode Available	1
Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints	Jan 26, 2023	Distributional Reinforcement Learningreinforcement-learning	CodeCode Available	1
Distributed Control of Partial Differential Equations Using Convolutional Reinforcement Learning	Jan 25, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	1
Select and Trade: Towards Unified Pair Trading with Hierarchical Reinforcement Learning	Jan 25, 2023	Hierarchical Reinforcement LearningPAIR TRADING	CodeCode Available	1
PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav	Jan 18, 2023	Imitation LearningNavigate	CodeCode Available	1
A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles	Jan 17, 2023	Autonomous VehiclesReinforcement Learning (RL)	CodeCode Available	1
Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation	Jan 14, 2023	Deep Reinforcement LearningIndustrial Robots	CodeCode Available	1
schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments	Jan 10, 2023	Deep Reinforcement LearningJob Shop Scheduling	CodeCode Available	1

Show:10 25 50

← PrevPage 34 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified