SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 49515000 of 15113 papers

TitleStatusHype
Intrinsic fluctuations of reinforcement learning promote cooperationCode0
A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning0
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization0
Deep reinforcement learning for quantum multiparameter estimation0
Transmit Power Control for Indoor Small Cells: A Method Based on Federated Reinforcement Learning0
Rethinking Conversational Recommendations: Is Decision Tree All You Need?Code1
A stabilizing reinforcement learning approach for sampled systems with partially unknown models0
Deep Anomaly Detection and Search via Reinforcement Learning0
Cell-Free Latent Go-ExploreCode1
Style-Agnostic Reinforcement LearningCode1
Model-Based Reinforcement Learning with SINDy0
A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space0
An Analysis of Model-Based Reinforcement Learning From Abstracted Observations0
Beyond Supervised Continual Learning: a Review0
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control0
Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement LearningCode1
Evolutionary Deep Reinforcement Learning for Dynamic Slice Management in O-RAN0
Digital Twin Assisted Risk-Aware Sleep Mode Management Using Deep Q-Networks0
Reinforcement Learning for Hardware Security: Opportunities, Developments, and Challenges0
Understanding the Limits of Poisoning Attacks in Episodic Reinforcement Learning0
Categorical semantics of compositional reinforcement learning0
Goal-Conditioned Q-Learning as Knowledge DistillationCode0
Normality-Guided Distributional Reinforcement Learning for Continuous Control0
Unsupervised Representation Learning in Deep Reinforcement Learning: A ReviewCode0
SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning0
RL-DistPrivacy: Privacy-Aware Distributed Deep Inference for low latency IoT systems0
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning0
DETERRENT: Detecting Trojans using Reinforcement Learning0
ATTRITION: Attacking Static Hardware Trojan Detection Techniques Using Reinforcement Learning0
An approach to implement Reinforcement Learning for Heterogeneous Vehicular Networks0
Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems0
Reinforcement Learning based Multi-connectivity Resource Allocation in Factory Automation Systems0
Socially Fair Reinforcement Learning0
Visual processing in context of reinforcement learning0
Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models0
Play with Emotion: Affect-Driven Reinforcement Learning0
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement LearningCode1
Importance Prioritized Policy DistillationCode0
Light-weight probing of unsupervised representations for Reinforcement LearningCode1
A Comparison of Reinforcement Learning Frameworks for Software Testing TasksCode0
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review0
Learning Task Automata for Reinforcement Learning using Hidden Markov Models0
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems0
UAS Navigation in the Real World Using Visual Observation0
Variance Reduction based Experience Replay for Policy OptimizationCode0
Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path0
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation0
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration0
A model-based approach to meta-Reinforcement Learning: Transformers and tree search0
Show:102550
← PrevPage 100 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified