SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 98019850 of 15113 papers

TitleStatusHype
Stability-Certified Reinforcement Learning via Spectral Normalization0
Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device0
Deep Reinforcement Learning for Long-Short Portfolio OptimizationCode0
Towards Continual Reinforcement Learning: A Review and Perspectives0
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II0
Unsupervised deep clustering and reinforcement learning can accurately segment MRI brain tumors with very small training sets0
SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning0
A State Representation Dueling Network for Deep Reinforcement Learning0
Learning Vehicle Routing Problems using Policy Optimisation0
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search0
Assured RL: Reinforcement Learning with Almost Sure Constraints0
Deep Stock Trading: A Hierarchical Reinforcement Learning Framework for Portfolio Optimization and Order Execution0
Intelligent Reflecting Surface Assisted Anti-Jamming Communications Based on Reinforcement Learning0
Rethink AI-based Power Grid Control: Diving Into Algorithm Design0
Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer0
Self-Imitation Advantage Learning0
QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement LearningCode0
Intelligent Resource Allocation in Dense LoRa Networks using Deep Reinforcement Learning0
A Dynamic Penalty Function Approach for Constraints-Handling in Reinforcement Learning0
Difference Rewards Policy Gradients0
Explicitly Encouraging Low Fractional Dimensional Trajectories Via Reinforcement LearningCode0
myGym: Modular Toolkit for Visuomotor Robotic Tasks0
Mobile Robot Planner with Low-cost Cameras Using Deep Reinforcement Learning0
Reinforcement Learning-based Product Delivery Frequency Control0
Model-Based Actor-Critic with Chance Constraint for Stochastic System0
Quantum reinforcement learning in continuous action space0
Minimax Strikes Back0
Reinforcement Learning for Test Case Prioritization0
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
Hierarchical principles of embodied reinforcement learning: A review0
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning0
Improving the Efficient Neural Architecture Search via Rewarding ModificationsCode0
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey0
Interactive Question Clarification in Dialogue via Reinforcement Learning0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
Towards Optimal District Heating Temperature Control in China with Deep Reinforcement Learning0
ViNG: Learning Open-World Navigation with Visual Goals0
MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning0
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FELCode0
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation0
A comparative evaluation of machine learning methods for robot navigation through human crowds0
CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving0
Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Grounding Artificial Intelligence in the Origins of Human Behavior0
Gegelati: Lightweight Artificial Intelligence through Generic and Evolvable Tangled Program Graphs0
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes0
Super Reinforcement Bros: Playing Super Mario Bros with Reinforcement LearningCode0
Train a snake with reinforcement learning algorithms0
Learning for MPC with Stability & Safety Guarantees0
Show:102550
← PrevPage 197 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified