SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1205112100 of 15113 papers

TitleStatusHype
A Reinforcement Learning-Based Framework for Solving Physical Design Routing Problem in the Absence of Large Test Sets0
How to Build User Simulators to Train RL-based Dialog SystemsCode0
Better Rewards Yield Better Summaries: Learning to Summarise Without ReferencesCode0
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorchCode2
Generalization in Transfer Learning0
Evolutionary reinforcement learning of dynamical large deviations0
Classification Betters Regression in Query-based Multi-document Summarisation Techniques for Question Answering: Macquarie University at BioASQ7b0
Logic and the 2-Simplicial Transformer0
Reinforcement Learning-based Automatic Diagnosis of Acute Appendicitis in Abdominal CT0
To Combine or Not To Combine? A Rainbow Deep Reinforcement Learning Agent for Dialog Policies0
Scalable Reinforcement-Learning-Based Neural Architecture Search for Cancer Deep Learning Research0
Generating Classical Chinese Poems from Vernacular ChineseCode0
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization0
Reinforcement learning with world model0
PaccMann^RL: Designing anticancer drugs from transcriptomic data via reinforcement learning0
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning0
An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase GenerationCode0
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented DialogCode0
Deep Actor-Critic Reinforcement Learning for Anomaly Detection0
Reinforcement Learning: Prediction, Control and Value Function Approximation0
Solving Math Word Problems with Double-Decoder Transformer0
Ensemble-Based Deep Reinforcement Learning for Chatbots0
Deep Reinforcement Learning for Chatbots Using Clustered Actions and Human-Likeness Rewards0
Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) for learning multi-goal, continuous action and state space controllersCode0
A Deep Reinforcement Learning Approach to Multi-component Job Scheduling in Edge Computing0
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning0
Dynamics-aware EmbeddingsCode0
Universal Policies to Learn Them AllCode0
A Comparison of Action Spaces for Learning Manipulation Tasks0
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision ProcessesCode0
Improving the dynamics of quantum sensors with reinforcement learning0
Reinforcement Learning in Healthcare: A Survey0
Practical Risk Measures in Reinforcement Learning0
Opponent Aware Reinforcement LearningCode0
On Convergence Rate of Adaptive Multiscale Value Function Approximation For Reinforcement Learning0
Analyzing Cyber-Physical Systems from the Perspective of Artificial Intelligence0
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy AdaptationCode0
Automated quantum programming via reinforcement learning for combinatorial optimizationCode0
Deep Reinforcement Learning for Foreign Exchange Trading0
Dialog State Tracking with Reinforced Data Augmentation0
A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access0
Learning to Sit: Synthesizing Human-Chair Interactions via Hierarchical Control0
Reinforcement Learning is not a Causal problem0
ARAML: A Stable Adversarial Training Framework for Text GenerationCode0
A Domain-Knowledge-Aided Deep Reinforcement Learning Approach for Flight Control Design0
An Autonomous Performance Testing Framework using Self-Adaptive Fuzzy Reinforcement LearningCode0
A survey on intrinsic motivation in reinforcement learning0
Transfer in Deep Reinforcement Learning using Knowledge Graphs0
Mitigating Multi-Stage Cascading Failure by Reinforcement Learning0
Show:102550
← PrevPage 242 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified