SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1220112250 of 15113 papers

TitleStatusHype
Learning Action-Transferable Policy with Action EmbeddingCode0
Spatiotemporally Constrained Action Space Attacks on Deep Reinforcement Learning AgentsCode0
Rewarding Coreference Resolvers for Being Consistent with World KnowledgeCode0
Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning0
Quasi-Newton Optimization Methods For Deep Learning Applications0
No Press Diplomacy: Modeling Multi-Agent GameplayCode0
Learning sparse representations in reinforcement learning0
Learning Dynamic Context Augmentation for Global Entity LinkingCode0
Inductive-bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters0
Answers Unite! Unsupervised Metrics for Reinforced Summarization ModelsCode0
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning0
LeDeepChef: Deep Reinforcement Learning Agent for Families of Text-Based Games0
Augmented Memory Networks for Streaming-Based Active One-Shot Learning0
A Reinforcement Learning-Based Framework for Solving Physical Design Routing Problem in the Absence of Large Test Sets0
Generalization in Transfer Learning0
How to Build User Simulators to Train RL-based Dialog SystemsCode0
Better Rewards Yield Better Summaries: Learning to Summarise Without ReferencesCode0
Evolutionary reinforcement learning of dynamical large deviations0
Classification Betters Regression in Query-based Multi-document Summarisation Techniques for Question Answering: Macquarie University at BioASQ7b0
Logic and the 2-Simplicial Transformer0
Reinforcement Learning-based Automatic Diagnosis of Acute Appendicitis in Abdominal CT0
To Combine or Not To Combine? A Rainbow Deep Reinforcement Learning Agent for Dialog Policies0
Scalable Reinforcement-Learning-Based Neural Architecture Search for Cancer Deep Learning Research0
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization0
Generating Classical Chinese Poems from Vernacular ChineseCode0
Reinforcement learning with world model0
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning0
PaccMann^RL: Designing anticancer drugs from transcriptomic data via reinforcement learning0
Reinforcement Learning: Prediction, Control and Value Function Approximation0
Solving Math Word Problems with Double-Decoder Transformer0
An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase GenerationCode0
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented DialogCode0
Deep Actor-Critic Reinforcement Learning for Anomaly Detection0
Ensemble-Based Deep Reinforcement Learning for Chatbots0
Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) for learning multi-goal, continuous action and state space controllersCode0
Deep Reinforcement Learning for Chatbots Using Clustered Actions and Human-Likeness Rewards0
A Deep Reinforcement Learning Approach to Multi-component Job Scheduling in Edge Computing0
Dynamics-aware EmbeddingsCode0
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning0
Universal Policies to Learn Them AllCode0
A Comparison of Action Spaces for Learning Manipulation Tasks0
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision ProcessesCode0
Improving the dynamics of quantum sensors with reinforcement learning0
Reinforcement Learning in Healthcare: A Survey0
Opponent Aware Reinforcement LearningCode0
Practical Risk Measures in Reinforcement Learning0
On Convergence Rate of Adaptive Multiscale Value Function Approximation For Reinforcement Learning0
Dialog State Tracking with Reinforced Data Augmentation0
Deep Reinforcement Learning for Foreign Exchange Trading0
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy AdaptationCode0
Show:102550
← PrevPage 245 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified