SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1270112750 of 15113 papers

TitleStatusHype
Similarities between policy gradient methods (PGM) in Reinforcement learning (RL) and supervised learning (SL)0
Knowledge Flow: Improve Upon Your Teachers0
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey0
ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement LearningCode0
Safer Deep RL with Shallow MCTS: A Case Study in Pommerman0
RL-Based User Association and Resource Allocation for Multi-UAV enabled MEC0
"Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications0
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning0
Teaching GANs to Sketch in Vector Format0
Reinforced Imitation in Heterogeneous Action Space0
Reinforcement Learning with Attention that Works: A Self-Supervised Approach0
Multi-Preference Actor Critic0
HOList: An Environment for Machine Learning of Higher-Order Theorem ProvingCode0
Self-Adapting Goals Allow Transfer of Predictive Models to New Tasks0
PaintBot: A Reinforcement Learning Approach for Natural Media Painting0
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement LearningCode0
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a SupercomputerCode0
Random Projection in Neural Episodic ControlCode0
Meta-learning Convolutional Neural Architectures for Multi-target Concrete Defect Classification with the COncrete DEfect BRidge IMage DatasetCode0
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents0
Centerline Depth World Reinforcement Learning-based Left Atrial Appendage Orifice Localization0
Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies0
Guided Meta-Policy Search0
Distributed Power Control for Large Energy Harvesting Networks: A Multi-Agent Deep Reinforcement Learning Approach0
Dynamically Optimal Treatment AllocationCode0
Cooperative Multi-Agent Reinforcement Learning Framework for Scalping Trading0
Power Control for Wireless VBR Video Streaming: From Optimization to Reinforcement Learning0
Risk Averse Robust Adversarial Reinforcement Learning0
Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints0
Autonomous Highway Driving using Deep Reinforcement Learning0
Learning Good Representation via Continuous Attention0
Improved Reinforcement Learning with Curriculum0
Towards Brain-inspired System: Deep Recurrent Reinforcement Learning for Simulated Self-driving Agent0
Robust Data Detection for MIMO Systems with One-Bit ADCs: A Reinforcement Learning Approach0
Wasserstein Dependency Measure for Representation Learning0
Regularizing Trajectory Optimization with Denoising Autoencoders0
Meta-Learning surrogate models for sequential decision making0
How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?Code0
Understanding the Relation Between Maximum-Entropy Inverse Reinforcement Learning and Behaviour Cloning0
Constructing Parsimonious Analytic Models for Dynamic Systems via Symbolic RegressionCode0
Generalized Off-Policy Actor-Critic0
Autoregressive Policies for Continuous Control Deep Reinforcement LearningCode0
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI gamesCode0
Energy Storage Management via Deep Q-Networks0
Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving0
Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus0
Interactions between Representation Learning and Supervision0
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies0
On the use of Deep Autoencoders for Efficient Embedded Reinforcement Learning0
Sub-Task Discovery with Limited Supervision: A Constrained Clustering Approach0
Show:102550
← PrevPage 255 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified