SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1160111650 of 15113 papers

TitleStatusHype
Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT0
Learning Sparse Representations Incrementally in Deep Reinforcement Learning0
ChainerRL: A Deep Reinforcement Learning Library0
Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning0
Intelligent Coordination among Multiple Traffic Intersections Using Multi-Agent Reinforcement Learning0
Learning Latent State Spaces for Planning through Reward Prediction0
Efficient Object Detection in Large Images using Deep Reinforcement LearningCode0
Unsupervised Curricula for Visual Meta-Reinforcement Learning0
Transformer Based Reinforcement Learning For Games0
Optimism in Reinforcement Learning with Generalized Linear Function Approximation0
Effects of a Social Force Model reward in Robot Navigation based on Deep Reinforcement Learning0
Increasing performance of electric vehicles in ride-hailing services using deep reinforcement learningCode0
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill DiscoveryCode0
From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions0
No-Regret Exploration in Goal-Oriented Reinforcement Learning0
Making Smart Homes Smarter: Optimizing Energy Consumption with Human in the Loop0
Observational Overfitting in Reinforcement Learning0
A pedestrian path-planning model in accordance with obstacle's danger with reinforcement learning0
How Does an Approximate Model Help in Reinforcement Learning?0
Deep Reinforcement Learning for Routing a Heterogeneous Fleet of Vehicles0
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of f-Regression Counterfactual Regret Minimization0
Iterative Policy-Space Expansion in Reinforcement Learning0
Reinforcement Learning with Non-Markovian Rewards0
Hindsight Credit AssignmentCode0
Inter-Level Cooperation in Hierarchical Reinforcement LearningCode0
Blind Inpainting of Large-scale Masks of Thin Structures with Adversarial and Reinforcement LearningCode0
Dynamic Pricing on E-commerce Platform with Deep Reinforcement Learning: A Field Experiment0
Reinforcement Learning with Convolutional Reservoir Computing0
Training Agents using Upside-Down Reinforcement LearningCode0
Scalable Reinforcement Learning for Multi-Agent Networked Systems0
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to ActionsCode0
Reinforcement learning for bandwidth estimation and congestion control in real-time communications0
Deep Model Compression Via Two-Stage Deep Reinforcement Learning0
AlgaeDICE: Policy Gradient from Arbitrary Experience0
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning0
Mo' States Mo' Problems: Emergency Stop Mechanisms from ObservationCode0
Optimal Policies Tend to Seek PowerCode0
Self-Learned Formula Synthesis in Set Theory0
SafeLife 1.0: Exploring Side Effects in Complex EnvironmentsCode0
Policy Optimization Reinforcement Learning with Entropy Regularization0
Human-Robot Collaboration via Deep Reinforcement Learning of Real-World Interactions0
Just Ask:An Interactive Learning Framework for Vision and Language Navigation0
A Model-Based Reinforcement Learning with Adversarial Training for Online RecommendationCode0
Learning Generalizable Device Placement Algorithms for Distributed Machine LearningCode0
Adaptive Auxiliary Task Weighting for Reinforcement LearningCode0
Learning Local Search Heuristics for Boolean SatisfiabilityCode0
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement LearningCode0
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning0
Adversary A3C for Robust Reinforcement Learning0
Learning Reward Machines for Partially Observable Reinforcement LearningCode0
Show:102550
← PrevPage 233 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified