Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10926–10950 of 15113 papers

Title	Date	Tasks	Status
FLAME: Factuality-Aware Alignment for Large Language Models	May 2, 2024	HallucinationInstruction Following	—Unverified
FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation	Mar 28, 2025	Reinforcement Learning (RL)	—Unverified
FlashRL: A Reinforcement Learning Platform for Flash Games	Jan 26, 2018	CPUDiversity	—Unverified
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning	Sep 3, 2018	Lifelong learningreinforcement-learning	—Unverified
Flatland-RL : Multi-Agent Reinforcement Learning on Trains	Dec 10, 2020	Imitation LearningMulti-agent Reinforcement Learning	—Unverified
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation	Mar 17, 2025	Imitation LearningObject	—Unverified
Flexible and Efficient Long-Range Planning Through Curious Exploration	Apr 22, 2020	Deep Reinforcement LearningImitation Learning	—Unverified
Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback	Jan 27, 2025	Offline RLReinforcement Learning (RL)	—Unverified
Flexible Multiple-Objective Reinforcement Learning for Chip Placement	Apr 13, 2022	Diversityreinforcement-learning	—Unverified
FlexPool: A Distributed Model-Free Deep Reinforcement Learning Algorithm for Joint Passengers & Goods Transportation	Jul 27, 2020	Deep Reinforcement LearningReinforcement Learning (RL)	—Unverified
Flipping-based Policy for Chance-Constrained Markov Decision Processes	Oct 9, 2024	Reinforcement Learning (RL)Safe Reinforcement Learning	—Unverified
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning	Jun 26, 2025	Action GenerationDecision Making	—Unverified
Flow Navigation by Smart Microswimmers via Reinforcement Learning	Jan 30, 2017	Navigatereinforcement-learning	—Unverified
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning	Dec 1, 2019	Deep Reinforcement Learningreinforcement-learning	—Unverified
Flow Shape Design for Microfluidic Devices Using Deep Reinforcement Learning	Nov 29, 2018	Deep Reinforcement Learningreinforcement-learning	—Unverified
Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks	Jul 25, 2022	Chemical ProcessDecision Making	—Unverified
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery	Dec 2, 2022	D4RLreinforcement-learning	—Unverified
Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals	Sep 25, 2018	Q-Learningreinforcement-learning	—Unverified
Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models	Jul 16, 2025	Game DesignReinforcement Learning (RL)	—Unverified
FNAS: Uncertainty-Aware Fast Neural Architecture Search	May 25, 2021	FairnessGPU	—Unverified
Focus On What Matters: Separated Models For Visual-Based RL Generalization	Sep 29, 2024	Image ReconstructionReinforcement Learning (RL)	—Unverified
FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model	Dec 3, 2018	Deep Reinforcement LearningProtein Folding	—Unverified
Following Instructions by Imagining and Reaching Visual Goals	Jan 25, 2020	Instruction FollowingReinforcement Learning	—Unverified
FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning	May 16, 2018	Deep Reinforcement LearningNavigate	—Unverified
Follow the Soldiers with Optimized Single-Shot Multibox Detection and Reinforcement Learning	Aug 2, 2023	object-detectionObject Detection	—Unverified

Show:10 25 50

← PrevPage 438 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified