SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1030110350 of 15113 papers

TitleStatusHype
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play0
Test-Cost Sensitive Methods for Identifying Nearby Points0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Disentangling causal effects for hierarchical reinforcement learning0
Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning0
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban0
Interactive Reinforcement Learning for Feature Selection with Decision Tree in the Loop0
MADRaS : Multi Agent Driving Simulator0
Reinforcement Learning of Sequential Price Mechanisms0
Multi-Reward based Reinforcement Learning for Neural Machine Translation0
Emergent Social Learning via Multi-agent Reinforcement Learning0
Student-Initiated Action Advising via Advice NoveltyCode0
Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs0
Bayesian Meta-reinforcement Learning for Traffic Signal Control0
Recognition Method of Important Words in Korean Text based on Reinforcement Learning0
Deep Reinforcement Learning with Mixed Convolutional Network0
Bridging the gap between Markowitz planning and deep reinforcement learning0
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning0
Deep Reinforcement Learning for Efficient Measurement of Quantum DevicesCode0
Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation0
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network0
AAMDRL: Augmented Asset Management with Deep Reinforcement Learning0
Entropy Regularization for Mean Field Games with Learning0
Toolpath design for additive manufacturing using deep reinforcement learning0
Teacher-Critical Training Strategies for Image Captioning0
Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks0
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network0
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive LearningCode0
Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments0
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current PolicyCode0
Cross Learning in Deep Q-Networks0
Trust-Region Method with Deep Reinforcement Learning in Analog Design Space Exploration0
Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions0
Agent Environment Cycle Games0
Jointly-Trained State-Action Embedding for Efficient Reinforcement Learning0
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon0
Deep Reinforcement Learning for DER Cyber-Attack Mitigation0
Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs0
The Emergence of Individuality in Multi-Agent Reinforcement Learning0
MDP Playground: Controlling Orthogonal Dimensions of Hardness in Toy Environments0
REPAINT: Knowledge Transfer in Deep Actor-Critic Reinforcement Learning0
Neuron Activation Analysis for Multi-Joint Robot Reinforcement Learning0
Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning0
Towards Heterogeneous Multi-Agent Reinforcement Learning with Graph Neural Networks0
What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator0
Transfer among Agents: An Efficient Multiagent Transfer Learning Framework0
Virtual Experience to Real World Application: Sidewalk Obstacle Avoidance Using Reinforcement Learning for Visually Impaired0
Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning0
Machine Learning in Event-Triggered Control: Recent Advances and Open Issues0
Scalable Deep Reinforcement Learning for Ride-Hailing0
Show:102550
← PrevPage 207 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified