SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 995110000 of 15113 papers

TitleStatusHype
Reinforcement Learning-based Black-Box Evasion Attacks to Link Prediction in Dynamic Graphs0
Flightmare: A Flexible Quadrotor SimulatorCode2
Data-driven Outer-Loop Control Using Deep Reinforcement Learning for Trajectory Tracking0
Beyond variance reduction: Understanding the true impact of baselines on policy optimization0
Control of a Nature-inspired Scorpion using Reinforcement Learning0
Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL0
Ranking Policy DecisionsCode0
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant Movement Primitives0
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems0
How does the structure embedded in learning policy affect learning quadruped locomotion?0
Reinforcement Learning with Feedback-modulated TD-STDP0
Real-world Video Adaptation with Reinforcement Learning0
Sample Efficiency in Sparse Reinforcement Learning: Or Your Money Back0
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles0
On the model-based stochastic value gradient for continuous reinforcement learningCode1
Query Focused Multi-document Summarisation of Biomedical Texts: Macquarie Universiy and the Australian National University at BioASQ8bCode0
Market-making with reinforcement-learning (SAC)Code1
Controlling Level of Unconsciousness by Titrating Propofol with Deep Reinforcement Learning0
AutoFS: Automated Feature Selection via Diversity-aware Interactive Reinforcement LearningCode0
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI0
The Advantage Regret-Matching Actor-Critic0
Query Focused Multi-document Summarisation of Biomedical TextsCode0
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning0
Synthetic Sample Selection via Reinforcement Learning0
Identifying Critical States by the Action-Based Variance of Expected Return0
Constrained Markov Decision Processes via Backward Value Functions0
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon0
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation0
t-Soft Update of Target Network for Deep Reinforcement Learning0
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing0
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based Reinforcement Learning0
Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep Reinforcement Learning Approach0
Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning0
Improved Memories Learning0
Learning Off-Policy with Online PlanningCode1
DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets0
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning0
Mobile Networks for Computer Go0
Social-Aware Incentive Mechanism for VehicularCrowdsensing by Deep Reinforcement LearningCode1
Model-Free Episodic Control with State Aggregation0
Biomechanic Posture Stabilisation via Iterative Training of Multi-policy Deep Reinforcement Learning Agents0
Adversarial Imitation Learning via Random Search0
Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication0
A Composable Specification Language for Reinforcement Learning TasksCode1
Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems0
NANCY: Neural Adaptive Network Coding methodologY for video distribution over wireless networks0
Static Neural Compiler Optimization via Deep Reinforcement Learning0
Model-free optimal control of discrete-time systems with additive and multiplicative noises0
Reinforcement Learning based dynamic weighing of Ensemble Models for Time Series Forecasting0
Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learningCode1
Show:102550
← PrevPage 200 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified