SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 73517375 of 15113 papers

TitleStatusHype
Federated Stochastic Approximation under Markov Noise and Heterogeneity: Applications in Reinforcement Learning0
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges0
Federated Transfer Reinforcement Learning for Autonomous Driving0
FedHQL: Federated Heterogeneous Q-Learning0
FedORA: Resource Allocation for Federated Learning in ORAN using Radio Intelligent Controllers0
Feedback Attribution for Counterfactual Bandit Learning in Multi-Domain Spoken Language Understanding0
Feedback-Based Tree Search for Reinforcement Learning0
Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning0
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning0
Feeling of Presence Maximization: mmWave-Enabled Virtual Reality Meets Deep Reinforcement Learning0
FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real0
Feudal Dialogue Management with Jointly Learned Feature Extractors0
Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning0
Feudal Multi-Agent Reinforcement Learning with Adaptive Network Partition for Traffic Signal Control0
Feudal Reinforcement Learning by Reading Manuals0
Feudal Reinforcement Learning for Dialogue Management in Large Domains0
Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning0
FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models0
Few-Shot Goal Inference for Visuomotor Learning and Planning0
Few-Shot Intent Inference via Meta-Inverse Reinforcement Learning0
Few-shot model-based adaptation in noisy conditions0
Few-Shot Multi-Hop Relation Reasoning over Knowledge Bases0
Few-Shot Preference Learning for Human-in-the-Loop RL0
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning0
Few-Shot Teamwork0
Show:102550
← PrevPage 295 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified