SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 63266350 of 15113 papers

TitleStatusHype
Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models0
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning0
DETERRENT: Detecting Trojans using Reinforcement Learning0
An approach to implement Reinforcement Learning for Heterogeneous Vehicular Networks0
Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems0
ATTRITION: Attacking Static Hardware Trojan Detection Techniques Using Reinforcement Learning0
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review0
A Comparison of Reinforcement Learning Frameworks for Software Testing TasksCode0
Importance Prioritized Policy DistillationCode0
Learning Task Automata for Reinforcement Learning using Hidden Markov Models0
UAS Navigation in the Real World Using Visual Observation0
Turning Mathematics Problems into Games: Reinforcement Learning and Gröbner bases together solve Integer Feasibility Problems0
Variance Reduction based Experience Replay for Policy OptimizationCode0
Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path0
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning0
A model-based approach to meta-Reinforcement Learning: Transformers and tree search0
Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation0
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration0
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning0
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm0
Evolutionary Quantum Architecture Search for Parametrized Quantum Circuits0
GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers0
What deep reinforcement learning tells us about human motor learning and vice-versa0
Solving Royal Game of Ur Using Reinforcement LearningCode0
Quantum Multi-Agent Meta Reinforcement Learning0
Show:102550
← PrevPage 254 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified