SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 70517100 of 15113 papers

TitleStatusHype
Error-related Potential driven Reinforcement Learning for adaptive Brain-Computer Interfaces0
Escape Room: A Configurable Testbed for Hierarchical Reinforcement Learning0
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization0
ES Is More Than Just a Traditional Finite-Difference Approximator0
Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches0
Estimating player completion rate in mobile puzzle games using reinforcement learning0
Estimating scale-invariant future in continuous time0
ETHER: Aligning Emergent Communication for Hindsight Experience Replay0
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model0
Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach0
Evading Web Application Firewalls with Reinforcement Learning0
Evaluating Disentanglement in Generative Models Without Knowledge of Latent Factors0
Evaluating Generalisation in General Video Game Playing0
Evaluating Human-like Explanations for Robot Actions in Reinforcement Learning Scenarios0
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks0
Evaluating Persuasion Strategies and Deep Reinforcement Learning methods for Negotiation Dialogue agents0
Evaluating Pretrained models for Deployable Lifelong Learning0
Evaluating Reinforcement Learning Algorithms in Observational Health Settings0
Evaluating Reinforcement Learning Safety and Trustworthiness in Cyber-Physical Systems0
Evaluating Robustness of Cooperative MARL0
Attacking c-MARL More Effectively: A Data Driven Approach0
Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping0
Evaluating State Representations for Reinforcement Learning of Turn-Taking Policies in Tutorial Dialogue0
Evaluating the Impact of Multiple DER Aggregators on Wholesale Energy Markets: A Hybrid Mean Field Approach0
Evaluating the Perceived Safety of Urban City via Maximum Entropy Deep Inverse Reinforcement Learning0
Evaluating the Safety of Deep Reinforcement Learning Models using Semi-Formal Verification0
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels0
Evaluation of Active Feature Acquisition Methods for Static Feature Settings0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi0
Evaluation of Look-ahead Economic Dispatch Using Reinforcement Learning0
Evaluation of Online Dialogue Policy Learning Techniques0
Evaluation-Time Policy Switching for Offline Reinforcement Learning0
Event Discovery for History Representation in Reinforcement Learning0
Event-Driven Models0
Event Extraction with Generative Adversarial Imitation Learning0
Event Identification as a Decision Process with Non-linear Representation of Text0
Event Tables for Efficient Experience Replay0
Evolution and The Knightian Blindspot of Machine Learning0
Evolutionarily-Curated Curriculum Learning for Deep Reinforcement Learning Agents0
Evolutionary algorithms for constructing an ensemble of decision trees0
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel Approach Towards DRL Combined with EA in Continuous Control Tasks0
Evolutionary Deep Reinforcement Learning for Dynamic Slice Management in O-RAN0
Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement Learning0
Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing0
Evolutionary Policy Optimization0
Evolutionary Policy Optimization0
Evolutionary Quantum Architecture Search for Parametrized Quantum Circuits0
Evolutionary Reinforcement Learning: A Survey0
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination0
Evolutionary Reinforcement Learning for Interpretable Decision-Making in Supply Chain Management0
Show:102550
← PrevPage 142 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified