SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 97019750 of 15113 papers

TitleStatusHype
MLGOPerf: An ML Guided Inliner to Optimize Performance0
MLGym: A New Framework and Benchmark for Advancing AI Research Agents0
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning0
MOBA: Multi-teacher Model Based Reinforcement Learning0
Mobile Cellular-Connected UAVs: Reinforcement Learning for Sky Limits0
Mobile Networks for Computer Go0
Mobile Robot Planner with Low-cost Cameras Using Deep Reinforcement Learning0
Mobile Robots Autonomous Exploration with Reinforcement Learning0
Mobile Robots Exploration via Deep Reinforcement Learning0
Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control0
Mobility Management for Cellular-Connected UAVs: A Learning-Based Approach0
Modality-Buffet for Real-Time Object Detection0
Model-agnostic Counterfactual Synthesis Policy for Interactive Recommendation0
Model-Agnostic Learning to Meta-Learn0
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks0
Model-Based Actor-Critic with Chance Constraint for Stochastic System0
Model-based adaptation for sample efficient transfer in reinforcement learning control of parameter-varying systems0
Model-based Bayesian Reinforcement Learning for Dialogue Management0
Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian0
Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization0
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey0
Model-based Dynamic Shielding for Safe and Efficient Multi-Agent Reinforcement Learning0
Model-Based Episodic Memory Induces Dynamic Hybrid Controls0
Model-based imitation learning from state trajectories0
Model-Based Imitation Learning Using Entropy Regularization of Model and Policy0
Model-Based Inverse Reinforcement Learning from Visual Demonstrations0
Model-based Lookahead Reinforcement Learning0
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models0
Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping0
Model based Multi-agent Reinforcement Learning with Tensor Decompositions0
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects0
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity0
Model-Based Offline Meta-Reinforcement Learning with Regularization0
Model-Based Offline Planning0
Model-based Offline Reinforcement Learning with Local Misspecification0
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation0
Model Based Planning with Energy Based Models0
Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation0
Model-based Policy Search for Partially Measurable Systems0
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks0
Model-based Reinforcement Learning and the Eluder Dimension0
Model-based Reinforcement Learning: A Survey0
Model-Based Reinforcement Learning Exploiting State-Action Equivalence0
Model-based reinforcement learning for biological sequence design0
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books0
Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements0
Model-Based Reinforcement Learning for Approximate Optimal Control with Temporal Logic Specifications0
Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-level0
Model-Based Reinforcement Learning via Stochastic Hybrid Models0
Show:102550
← PrevPage 195 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified