SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 49014950 of 15113 papers

TitleStatusHype
A Review of Tracking, Prediction and Decision Making Methods for Autonomous Driving0
A Review of Uncertainty for Deep Reinforcement Learning0
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges0
Car-Following Models: A Multidisciplinary Review0
Argumentative Reward Learning: Reasoning About Human Preferences0
Argus: Smartphone-enabled Human Cooperation via Multi-Agent Reinforcement Learning for Disaster Situational Awareness0
ARIA: Training Language Agents with Intention-Driven Reward Aggregation0
A Policy Optimization Method Towards Optimal-time Stability0
A Roadmap Towards Improving Multi-Agent Reinforcement Learning With Causal Discovery And Inference0
A Robotic Model of Hippocampal Reverse Replay for Reinforcement Learning0
A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems0
A Robust Fuel Optimization Strategy For Hybrid Electric Vehicles: A Deep Reinforcement Learning Based Continuous Time Design Approach0
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch0
Artificial Intelligence Approaches To UCAV Autonomy0
Artificial Intelligence as Structural Estimation: Economic Interpretations of Deep Blue, Bonanza, and AlphaGo0
Artificial Intelligence-based Decision Support Systems for Precision and Digital Health0
Artificial Intelligence in Vehicular Wireless Networks: A Case Study Using ns-30
A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning0
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants0
A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control0
Safe Model-Based Reinforcement Learning for Systems with Parametric Uncertainties0
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering0
A Scalable Deep Reinforcement Learning Model for Online Scheduling Coflows of Multi-Stage Jobs for High Performance Computing0
A Scalable Finite Difference Method for Deep Reinforcement Learning0
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions0
A Scalable Reinforcement Learning Approach for Attack Allocation in Swarm to Swarm Engagement Problems0
A Scalable Reinforcement Learning-based System Using On-Chain Data for Cryptocurrency Portfolio Management0
A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis0
A Secure Learning Control Strategy via Dynamic Camouflaging for Unknown Dynamical Systems under Attacks0
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction0
A storage expansion planning framework using reinforcement learning and simulation-based optimization0
ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning0
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play0
A Complete Characterization of Linear Estimators for Offline Policy Evaluation0
A Short Note on Soft-max and Policy Gradients in Bandits Problems0
A Short Note on the Relationship of Information Gain and Eluder Dimension0
A Short Survey On Memory Based Reinforcement Learning0
A Short Survey on Probabilistic Reinforcement Learning0
A short variational proof of equivalence between policy gradients and soft Q learning0
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony0
A Signaling Game Approach to Databases Querying and Interaction0
A Simple Imitation Learning Method via Contrastive Regularization0
A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning0
A Simple Reward-free Approach to Constrained Reinforcement Learning0
A Simple Sparse Denoising Layer for Robust Deep Learning0
A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
Novelty Detection in Reinforcement Learning with World Models0
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences0
Ask1: Development and Reinforcement Learning-Based Control of a Custom Quadruped Robot0
Show:102550
← PrevPage 99 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified