SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 83018350 of 15113 papers

TitleStatusHype
Curious Exploration and Return-based Memory Restoration for Deep Reinforcement LearningCode0
BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning0
Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling0
Mean Field MARL Based Bandwidth Negotiation Method for Massive Devices Spectrum Sharing0
Mitigating Political Bias in Language Models Through Reinforced Calibration0
Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning0
Discrete-Time Mean Field Control with Environment States0
Emotional Contagion-Aware Deep Reinforcement Learning for Antagonistic Crowd Simulation0
Adapting to Reward Progressivity via Spectral Reinforcement LearningCode0
Hypernetwork Dismantling via Deep Reinforcement Learning0
Constructions in combinatorics via neural networksCode1
Medium Access using Distributed Reinforcement Learning for IoTs with Low-Complexity Wireless Transceivers0
Pre-training of Deep RL Agents for Improved Learning under Domain Randomization0
What is Going on Inside Recurrent Meta Reinforcement Learning Agents?0
Using Meta Reinforcement Learning to Bridge the Gap between Simulation and Experiment in Energy Demand Response0
Adversarial Inverse Reinforcement Learning for Mean Field Games0
Reward (Mis)design for Autonomous Driving0
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning0
A Reinforcement Learning Environment for Polyhedral Optimizations0
End-to-End Intersection Handling using Multi-Agent Deep Reinforcement Learning0
Implementing Reinforcement Learning Algorithms in Retail Supply Chains with OpenAI Gym Toolkit0
Controlling earthquake-like instabilities using artificial intelligence0
A Scalable and Reproducible System-on-Chip Simulation for Reinforcement LearningCode1
Adaptive Adversarial Training for Meta Reinforcement Learning0
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients0
Computational Performance of Deep Reinforcement Learning to find Nash EquilibriaCode1
End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learningCode0
ANT: Learning Accurate Network Throughput for Better Adaptive Video Streaming0
Constraint-Guided Reinforcement Learning: Augmenting the Agent-Environment-InteractionCode1
A Deep Reinforcement Learning Approach for the Meal Delivery Problem0
Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand SystemsCode1
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies0
Safe Chance Constrained Reinforcement Learning for Batch Process ControlCode0
Reinforcement Learning using Guided Observability0
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention0
A learning gap between neuroscience and reinforcement learningCode0
Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data0
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
Policy Fusion for Adaptive and Customizable Reinforcement Learning Agents0
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks0
Tackling Variabilities in Autonomous Driving0
Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems0
CVLight: Decentralized Learning for Adaptive Traffic Signal Control with Connected Vehicles0
DRL: Deep Reinforcement Learning for Intelligent Robot Control -- Concept, Literature, and Future0
Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning0
Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning0
Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning0
MBRL-Lib: A Modular Library for Model-based Reinforcement LearningCode2
Outcome-Driven Reinforcement Learning via Variational Inference0
Model-predictive control and reinforcement learning in multi-energy system case studies0
Show:102550
← PrevPage 167 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified