SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 82518300 of 15113 papers

TitleStatusHype
A Reinforcement Learning Environment for Multi-Service UAV-enabled Wireless SystemsCode1
Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive Environments0
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning0
Return-based Scaling: Yet Another Normalisation Trick for Deep RL0
Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty0
Reinforcement Learning from Reformulations in Conversational Question Answering over Knowledge GraphsCode1
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation PerspectiveCode1
Efficient Self-Supervised Data Collection for Offline Robot Learning0
Adaptive Policy Transfer in Reinforcement Learning0
Dynamic Multichannel Access via Multi-agent Reinforcement Learning: Throughput and Fairness Guarantees0
A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker EnvironmentCode0
Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning0
Reinforcement learning of rare diffusive dynamics0
Parameter-free Gradient Temporal Difference Learning0
PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene Rearrangement Planning0
Reinforcement Learning with Expert Trajectory For Quantitative Trading0
Improving Cost Learning for JPEG Steganography by Exploiting JPEG Domain Knowledge0
Differentiable Neural Architecture Search for Extremely Lightweight Image Super-ResolutionCode1
A parallel-network continuous quantitative trading model with GARCH and PPO0
RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning0
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies0
Evening the Score: Targeting SARS-CoV-2 Protease Inhibition in Graph Generative Models for Therapeutic CandidatesCode1
Deep reinforcement learning-designed radiofrequency waveform in MRICode1
Using reinforcement learning to design an AI assistantfor a satisfying co-op experience0
Utilizing Skipped Frames in Action Repeats via Pseudo-Actions0
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise RolloutsCode1
Reward prediction for representation learning and reward shaping0
Deep Graph Convolutional Reinforcement Learning for Financial Portfolio Management -- DeepPocket0
A Reinforcement Learning-based Economic Model Predictive Control Framework for Autonomous Operation of Chemical Reactors0
Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization ProblemsCode1
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning0
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance0
Solving Sokoban with forward-backward reinforcement learning0
Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network0
UVIP: Model-Free Approach to Evaluate Reinforcement Learning AlgorithmsCode0
Learning Algorithms for Regenerative Stopping Problems with Applications to Shipping Consolidation in Logistics0
Reinforcement Learning for Scalable Logic Optimization with Graph Neural Networks0
On the Linear convergence of Natural Policy Gradient Algorithm0
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning0
Data-Efficient Reinforcement Learning for Malaria Control0
Deep Reinforcement Learning for Adaptive Exploration of Unknown EnvironmentsCode1
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference0
Learning swimming escape patterns for larval fish under energy constraints0
Hierarchical Reinforcement Learning for Air-to-Air Combat0
Robotic Surgery With Lean Reinforcement LearningCode0
RL-IoT: Reinforcement Learning to Interact with IoT DevicesCode1
Reinforcement Learning for Ridesharing: An Extended Survey0
Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning0
CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Show:102550
← PrevPage 166 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified