SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1210112150 of 15113 papers

TitleStatusHype
Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement LearningCode0
Deep Reinforcement Learning with Modulated Hebbian plus Q Network ArchitectureCode0
Leveraging Human Guidance for Deep Reinforcement Learning Tasks0
Bayesian Optimization for Iterative LearningCode0
A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning0
How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?0
Redirection Controller Using Reinforcement Learning0
On the Convergence of Approximate and Regularized Policy Iteration Schemes0
Meta-Inverse Reinforcement Learning with Probabilistic Context VariablesCode0
Robot Sound Interpretation: Combining Sight and Sound in Learning-Based Control0
Instance-dependent _-bounds for policy evaluation in tabular reinforcement learning0
MACS: Deep Reinforcement Learning based SDN Controller Synchronization Policy Design0
Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games0
Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator0
Segregation Dynamics with Reinforcement Learning and Agent Based Modeling0
Sample Efficient Policy Gradient Methods with Recursive Variance ReductionCode0
Automated Lane Change Decision Making using Deep Reinforcement Learning in Dynamic and Uncertain Highway Environment0
Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach0
A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning0
DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning0
A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming0
Controllable Length Control Neural Encoder-Decoder via Reinforcement Learning0
A Review of Tracking, Prediction and Decision Making Methods for Autonomous Driving0
Hierarchical Reinforcement Learning for Open-Domain Dialog0
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning0
Adversarial Feature Training for Generalizable Robotic Visuomotor Control0
Generating Black-Box Adversarial Examples for Text Classifiers Using a Deep Reinforced Model0
Stock market microstructure inference via multi-agent reinforcement learning0
MDP Playground: An Analysis and Debug Testbed for Reinforcement LearningCode0
Off-road Autonomous Vehicles Traversability Analysis and Trajectory Planning Based on Deep Inverse Reinforcement Learning0
Meta Reinforcement Learning for Sim-to-real Domain Adaptation0
Leveraging human Domain Knowledge to model an empirical Reward function for a Reinforcement Learning problem0
Data Centers Job Scheduling with Deep Reinforcement Learning0
Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement LearningCode0
Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces0
Driving in Dense Traffic with Model-Free Reinforcement LearningCode0
Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space0
State Representation Learning from Demonstration0
Wield: Systematic Reinforcement Learning With Progressive Randomization0
Model Based Planning with Energy Based Models0
Node Injection Attacks on Graphs via Reinforcement Learning0
Flight Controller Synthesis Via Deep Reinforcement LearningCode0
Learning to Recover Sparse Signals0
Active Learning for Risk-Sensitive Inverse Reinforcement Learning0
HJB Optimal Feedback Control with Deep Differential Value Functions and Action Constraints0
DL2: A Deep Learning-driven Scheduler for Deep Learning ClustersCode0
AITuning: Machine Learning-based Tuning Tool for Run-Time Communication Libraries0
Petri Net Machines for Human-Agent Interaction0
Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies0
Say What I Want: Towards the Dark Side of Neural Dialogue Models0
Show:102550
← PrevPage 243 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified