SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1140111450 of 15113 papers

TitleStatusHype
Measuring the Reliability of Reinforcement Learning AlgorithmsCode0
Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT0
Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning0
Learning to Code: Coded Caching via Deep Reinforcement Learning0
Learning Latent State Spaces for Planning through Reward Prediction0
Efficient Object Detection in Large Images using Deep Reinforcement LearningCode0
Intelligent Coordination among Multiple Traffic Intersections Using Multi-Agent Reinforcement Learning0
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances0
ChainerRL: A Deep Reinforcement Learning Library0
Learning Sparse Representations Incrementally in Deep Reinforcement Learning0
Optimism in Reinforcement Learning with Generalized Linear Function Approximation0
Unsupervised Curricula for Visual Meta-Reinforcement Learning0
Transformer Based Reinforcement Learning For Games0
Effects of a Social Force Model reward in Robot Navigation based on Deep Reinforcement Learning0
Increasing performance of electric vehicles in ride-hailing services using deep reinforcement learningCode0
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill DiscoveryCode0
From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions0
No-Regret Exploration in Goal-Oriented Reinforcement Learning0
Deep Reinforcement Learning for Routing a Heterogeneous Fleet of Vehicles0
A pedestrian path-planning model in accordance with obstacle's danger with reinforcement learning0
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of f-Regression Counterfactual Regret Minimization0
How Does an Approximate Model Help in Reinforcement Learning?0
Observational Overfitting in Reinforcement Learning0
VALAN: Vision and Language Agent NavigationCode1
Making Smart Homes Smarter: Optimizing Energy Consumption with Human in the Loop0
Reinforcement Learning with Convolutional Reservoir Computing0
Scalable Reinforcement Learning for Multi-Agent Networked Systems0
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to ActionsCode0
Training Agents using Upside-Down Reinforcement LearningCode0
Reinforcement Learning with Non-Markovian Rewards0
Dynamic Pricing on E-commerce Platform with Deep Reinforcement Learning: A Field Experiment0
Inter-Level Cooperation in Hierarchical Reinforcement LearningCode0
Hindsight Credit AssignmentCode0
Blind Inpainting of Large-scale Masks of Thin Structures with Adversarial and Reinforcement LearningCode0
Iterative Policy-Space Expansion in Reinforcement Learning0
Simplified Action Decoder for Deep Multi-Agent Reinforcement LearningCode1
Deep Model Compression Via Two-Stage Deep Reinforcement Learning0
Reinforcement learning for bandwidth estimation and congestion control in real-time communications0
AlgaeDICE: Policy Gradient from Arbitrary Experience0
Optimal Policies Tend to Seek PowerCode0
Mo' States Mo' Problems: Emergency Stop Mechanisms from ObservationCode0
Self-Learned Formula Synthesis in Set Theory0
SafeLife 1.0: Exploring Side Effects in Complex EnvironmentsCode0
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning0
Dream to Control: Learning Behaviors by Latent ImaginationCode1
Leveraging Procedural Generation to Benchmark Reinforcement LearningCode2
Human-Robot Collaboration via Deep Reinforcement Learning of Real-World Interactions0
Just Ask:An Interactive Learning Framework for Vision and Language Navigation0
Policy Optimization Reinforcement Learning with Entropy Regularization0
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning0
Show:102550
← PrevPage 229 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified