SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 98519900 of 15113 papers

TitleStatusHype
Robust Reinforcement Learning-based Autonomous Driving Agent for Simulation and Real World0
SUMBT+LaRL: Effective Multi-domain End-to-end Neural Task-oriented Dialog System0
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearnCode0
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management0
Deep Reinforcement Learning for On-line Dialogue State Tracking0
Is Q-Learning Provably Efficient? An Extended Analysis0
Contextual Bandits for adapting to changing User preferences over time0
DISPATCH: Design Space Exploration of Cyber-Physical Systems0
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion0
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization0
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning0
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning0
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue SystemsCode0
Mobile Cellular-Connected UAVs: Reinforcement Learning for Sky Limits0
Deep Reinforcement Learning Methods for Structure-Guided Processing Path OptimizationCode0
RL STaR Platform: Reinforcement Learning for Simulation based Training of RobotsCode1
Reinforcement Learning Approaches in Social Robotics0
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control0
Multiplayer Support for the Arcade Learning Environment0
Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms0
Construction of Polar Codes with Reinforcement Learning0
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control0
Private Reinforcement Learning with PAC and Regret Guarantees0
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos0
Efficient Reinforcement Learning Development with RLzoo0
GRAC: Self-Guided and Self-Regularized Actor-CriticCode0
HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal MemoryCode0
A Contraction Approach to Model-based Reinforcement Learning0
Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulationsCode1
GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning0
Finding Effective Security Strategies through Reinforcement Learning and Self-PlayCode1
Knowledge-Assisted Deep Reinforcement Learning in 5G Scheduler Design: From Theoretical Framework to Implementation0
SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement LearningCode1
Reward Maximisation through Discrete Active Inference0
Reconstructing Actions To Explain Deep Reinforcement Learning0
Time your hedge with Deep Reinforcement Learning0
Theory of Mind with Guilt Aversion Facilitates Cooperative Reinforcement Learning0
Transfer Learning in Deep Reinforcement Learning: A Survey0
Meta-AAD: Active Anomaly Detection with Deep Reinforcement LearningCode1
Text Generation by Learning from DemonstrationsCode1
DRL-FAS: A Novel Framework Based on Deep Reinforcement Learning for Face Anti-Spoofing0
Reinforcement Learning for Strategic Recommendations0
Soft policy optimization using dual-track advantage estimator0
Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly DataCode1
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents0
Decoding Polar Codes with Reinforcement Learning0
Decoupling Representation Learning from Reinforcement LearningCode2
Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile NetworksCode1
Efficient Transformers: A Survey0
Reinforcement Learning for Dynamic Resource Optimization in 5G Radio Access Network Slicing0
Show:102550
← PrevPage 198 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified