SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 94519500 of 15113 papers

TitleStatusHype
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning0
Deep Reinforcement Learning of Transition States0
DeepMind Lab2DCode1
Active Reinforcement Learning: Observing Rewards at a Cost0
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges0
Imposing Robust Structured Control Constraint on Reinforcement Learning of Linear Quadratic Regulator0
Gaussian RAM: Lightweight Image Classification via Stochastic Retina-Inspired Glimpse and Reinforcement LearningCode1
Hierarchical reinforcement learning for efficient exploration and transfer0
Griddly: A platform for AI research in games0
Self-supervised reinforcement learning for speaker localisation with the iCub humanoid robot0
Reinforcement Learning with Videos: Combining Offline Observations with InteractionCode1
Steady State Analysis of Episodic Reinforcement Learning0
Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural NetworkCode1
Adaptive Neural Architectures for Recommender Systems0
Non-local Optimization: Imposing Structure on Optimization Problems by Relaxation0
pymgrid: An Open-Source Python Microgrid Simulator for Applied Artificial Intelligence ResearchCode1
Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching TasksCode0
Proximal Policy Optimization via Enhanced Exploration Efficiency0
Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning0
Reinforcement Learning with Dual-Observation for General Video Game PlayingCode0
Reinforcement Learning with Time-dependent Goals for Robotic Musicians0
Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement LearningCode1
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension0
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee0
Behaviorally Diverse Traffic Simulation via Reinforcement Learning0
Kinematics-Guided Reinforcement Learning for Object-Aware 3D Ego-Pose Estimation0
Dirichlet policies for reinforced factor portfolios0
Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network0
Perturbation-based exploration methods in deep reinforcement learning0
Model-based Reinforcement Learning from Signal Temporal Logic Specifications0
What Did You Think Would Happen? Explaining Agent Behaviour Through Intended OutcomesCode0
Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms0
Optimizing Age of Information Through Aerial Reconfigurable Intelligent Surfaces: A Deep Reinforcement Learning Approach0
Challenges of Applying Deep Reinforcement Learning in Dynamic Dispatching0
Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation0
Combining Propositional Logic Based Decision Diagrams with Decision Making in Urban Systems0
Behavior Planning at Urban Intersections through Hierarchical Reinforcement Learning0
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces0
Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement LearningCode1
Geometric Deep Reinforcement Learning for Dynamic DAG SchedulingCode1
Deep reinforcement learning for RAN optimization and control0
f-IRL: Inverse Reinforcement Learning via State Marginal MatchingCode1
Automated Adversary Emulation for Cyber-Physical Systems via Reinforcement Learning0
Deep Reinforcement Learning for Navigation in AAA Video Games0
Safe Trajectory Planning Using Reinforcement Learning for Self Driving0
Trajectory Planning for Autonomous Vehicles Using Hierarchical Reinforcement LearningCode1
Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems0
Reinforcement Learning for Autonomous Driving with Latent State Inference and Spatial-Temporal Relationships0
Reinforcement Learning for Assignment problem0
Online Sparse Reinforcement Learning0
Show:102550
← PrevPage 190 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified