SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 61766200 of 15113 papers

TitleStatusHype
TransDreamer: Reinforcement Learning with Transformer World Models0
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning0
Can Interpretable Reinforcement Learning Manage Prosperity Your Way?0
Distributed Multi-Agent Reinforcement Learning with One-hop Neighbors and Compute Straggler MitigationCode1
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningCode2
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization0
CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban DrivingCode1
A Survey on Deep Reinforcement Learning-based Approaches for Adaptation and Generalization0
Improving Intrinsic Exploration with Language Abstractions0
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs0
A Survey of Explainable Reinforcement Learning0
Retrieval-Augmented Reinforcement Learning0
Should I send this notification? Optimizing push notifications decision making by modeling the future0
Robust Reinforcement Learning via Genetic Curriculum0
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight ControlCode1
Open-Ended Reinforcement Learning with Neural Reward FunctionsCode1
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo0
Branching Reinforcement Learning0
Domain Adaptive Fake News Detection via Reinforcement Learning0
An Intrusion Response System utilizing Deep Q-Networks and System PartitionsCode0
Deep Reinforcement Learning Based Multi-Access Edge Computing Schedule for Internet of Vehicle0
Energy-Efficient Parking Analytics System using Deep Reinforcement LearningCode0
Safe Reinforcement Learning by Imagining the Near FutureCode1
Show:102550
← PrevPage 248 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified