SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 65016550 of 15113 papers

TitleStatusHype
DISPATCH: Design Space Exploration of Cyber-Physical Systems0
Dissipative residual layers for unsupervised implicit parameterization of data manifolds0
Distal Explanations for Model-free Explainable Reinforcement Learning0
Distill2Explain: Differentiable decision trees for explainable reinforcement learning in energy application controllers0
Distillation Strategies for Proximal Policy Optimization0
Distilled Agent DQN for Provable Adversarial Robustness0
Distilled Domain Randomization0
Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning0
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems0
Distilling Neuron Spike with High Temperature in Reinforcement Learning Agents0
Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression0
Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning0
Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?0
DisTop: Discovering a Topological representation to learn diverse and rewarding skills0
Distort-and-Recover: Color Enhancement using Deep Reinforcement Learning0
Distral: Robust Multitask Reinforcement Learning0
Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks0
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping0
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph0
Distributed Deep Q-Learning0
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox0
Distributed Deep Reinforcement Learning: An Overview0
Distributed Deep Reinforcement Learning for Intelligent Load Scheduling in Residential Smart Grids0
Distributed Deep Reinforcement Learning for Functional Split Control in Energy Harvesting Virtualized Small Cells0
Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing0
Distributed Deep Reinforcement Learning for Intelligent Traffic Monitoring with a Team of Aerial Robots0
Combining Contention-Based Spectrum Access and Adaptive Modulation using Deep Reinforcement Learning0
Distributed Edge Caching via Reinforcement Learning in Fog Radio Access Networks0
Distributed Energy Management and Demand Response in Smart Grids: A Multi-Agent Deep Reinforcement Learning Framework0
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control0
Distributed Learning on Heterogeneous Resource-Constrained Devices0
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control0
Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise0
Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC0
Distributed Multitask Reinforcement Learning with Quadratic Convergence0
Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus0
Distributed Policy Gradient with Variance Reduction in Multi-Agent Reinforcement Learning0
Distributed Power Control for Large Energy Harvesting Networks: A Multi-Agent Deep Reinforcement Learning Approach0
Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation0
Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control0
Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge Caching0
Distributed Reinforcement Learning for Robot Teams: A Review0
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators0
Distributed Reinforcement Learning via Gossip0
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management0
Distributed traffic light control at uncoupled intersections with real-world topology by deep reinforcement learning0
Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning0
Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning0
Distributed Wildfire Surveillance with Autonomous Aircraft using Deep Reinforcement Learning0
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control0
Show:102550
← PrevPage 131 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified