SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 32513300 of 15113 papers

TitleStatusHype
Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs0
Data-Efficient Reinforcement Learning in Continuous-State POMDPs0
Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games0
Creativity of AI: Hierarchical Planning Model Learning for Facilitating Deep Reinforcement Learning0
Creativity in Robot Manipulation with Deep Reinforcement Learning0
Data-efficient visuomotor policy training using reinforcement learning and generative models0
Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep Reinforcement Learning Approach0
Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning0
Accelerating Training in Pommerman with Imitation and Reinforcement Learning0
Data Poisoning Attacks in Contextual Bandits0
Data-pooling Reinforcement Learning for Personalized Healthcare Intervention0
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning0
Deep reinforcement learning for optical systems: A case study of mode-locked lasers0
Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks0
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning0
A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Data Valuation for Offline Reinforcement Learning0
A Tensor Network Approach to Finite Markov Decision Processes0
Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning0
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning0
DCE: Offline Reinforcement Learning With Double Conservative Estimates0
DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments0
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes0
Group-Agent Reinforcement Learning0
DDPG based on multi-scale strokes for financial time series trading strategy0
Modified DDPG car-following model with a real-world human driving experience with CARLA simulator0
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning0
A Fast Convergence Theory for Offline Decision Making0
Dealing with Limited Backhaul Capacity in Millimeter Wave Systems: A Deep Reinforcement Learning Approach0
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning0
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
Dealing with Sparse Rewards Using Graph Neural Networks0
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning0
DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation0
INTAGS: Interactive Agent-Guided Simulation0
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning0
Death and Suicide in Universal Artificial Intelligence0
A SUMO Framework for Deep Reinforcement Learning Experiments Solving Electric Vehicle Charging Dispatching Problem0
De-Biased Modelling of Search Click Behavior with Reinforcement Learning0
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems0
Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations0
Decentralized Automotive Radar Spectrum Allocation to Avoid Mutual Interference Using Reinforcement Learning0
Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning0
A Succinct Summary of Reinforcement Learning0
Deep Reinforcement Learning for NLP0
A Gentle Lecture Note on Filtrations in Reinforcement Learning0
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning0
Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure0
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems0
Show:102550
← PrevPage 66 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified