SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 74517500 of 15113 papers

TitleStatusHype
Coordinated Random Access for Industrial IoT With Correlated Traffic By Reinforcement-Learning0
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach0
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive Deep Reinforcement Learning0
Soft Actor-Critic With Integer Actions0
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning0
Reinforcement Learning on Encrypted Data0
Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing0
Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning0
Estimation of Warfarin Dosage with Reinforcement LearningCode0
Short Quantum Circuits in Reinforcement Learning Policies for the Vehicle Routing Problem0
What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Balancing detectability and performance of attacks on the control channel of Markov Decision ProcessesCode0
Back to Basics: Deep Reinforcement Learning in Traffic Signal ControlCode0
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback0
Automatically Exposing Problems with Neural Dialog ModelsCode0
WaveCorr: Correlation-savvy Deep Reinforcement Learning for Portfolio ManagementCode0
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation0
Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal LocomotionCode1
Towards optimized actions in critical situations of soccer games with deep reinforcement learningCode0
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain0
Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents0
Few-shot Quality-Diversity OptimizationCode0
DSDF: An approach to handle stochastic agents in collaborative multi-agent reinforcement learning0
Gradient Imitation Reinforcement Learning for Low Resource Relation ExtractionCode1
Learning to Navigate Intersections with Unsupervised Driver Trait InferenceCode1
Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model CheckingCode0
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach0
A Practical Adversarial Attack on Contingency Detection of Smart Energy Systems0
safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning in RoboticsCode1
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods0
Learning-to-defer for sequential medical decision-making under uncertainty0
RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search0
Reinforcement Learning for Load-balanced Parallel Particle Tracing0
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless Powered MEC Networks with Fairness Constraint0
Direct Random Search for Fine Tuning of Deep Reinforcement Learning PoliciesCode0
Federated Ensemble Model-based Reinforcement Learning in Edge Computing0
Concave Utility Reinforcement Learning with Zero-Constraint Violations0
EMVLight: A Decentralized Reinforcement Learning Framework for Efficient Passage of Emergency Vehicles0
Learning Selective Communication for Multi-Agent Path FindingCode1
A Socially Aware Reinforcement Learning Agent for The Single Track Road Problem0
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation0
Financial Trading with Feature Preprocessing and Recurrent Reinforcement Learning0
Physics-based Deep LearningCode2
Optimizing a domestic battery and solar photovoltaic system with deep reinforcement learning0
Projected State-action Balancing Weights for Offline Reinforcement Learning0
Multi-agent deep reinforcement learning (MADRL) meets multi-user MIMO systems0
Show:102550
← PrevPage 150 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified