SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 93019350 of 15113 papers

TitleStatusHype
Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction0
Metalearning Using Structure-rich Pipeline Representations for Better AutoML0
RL-Controller: a reinforcement learning framework for active structural control0
Hybrid computer approach to train a machine learning system0
A Survey of Forex and Stock Price Prediction Using Deep Learning0
Constrained Text Generation with Global Guidance -- Case Study on CommonGen0
Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters0
Adapting User Interfaces with Model-based Reinforcement Learning0
A Vision Based Deep Reinforcement Learning Algorithm for UAV Obstacle Avoidance0
Adversarial attacks in consensus-based multi-agent reinforcement learning0
A Quadratic Actor Network for Model-Free Reinforcement LearningCode0
Auto-COP: Adaptation Generation in Context-Oriented Programming using Reinforcement Learning Options0
A Reinforcement Learning Based Approach to Play Calling in Football0
Multi-Task Federated Reinforcement Learning with Adversaries0
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks0
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning0
Symbolic Reinforcement Learning for Safe RAN Control0
Policy Search with Rare Significant Events: Choosing the Right Partner to Cooperate withCode0
Streaming Linear System Identification with Reverse Experience Replay0
Using Cognitive Models to Train Warm Start Reinforcement Learning Agents for Human-Computer Interactions0
Maximum Entropy RL (Provably) Solves Some Robust RL Problems0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Multi-Objective Reinforcement Learning based Multi-Microgrid System Optimisation Problem0
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning0
A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems0
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning0
Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme0
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded as Weighted Finite Automata0
Learning to Infer Unseen Contexts in Causal Contextual Reinforcement Learning0
Automatic Goal Generation using Dynamical Distance Learning0
A Learning-Based Computational Impact Time Guidance0
Learning to Explore a Class of Multiple Reward-Free Environments0
Learning Task Informed Abstractions0
I am Robot: Neuromuscular Reinforcement Learning to Actuate Human Limbs through Functional Electrical Stimulation0
Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning0
Challenges for Reinforcement Learning in Healthcare0
Less Suboptimal Learning and Control in Variational POMDPs0
Learning State Representations via Temporal Cycle-Consistency Constraint in Model-Based Reinforcement Learning0
LOCO: Adaptive exploration in reinforcement learning via local estimation of contraction coefficients0
A Scavenger Hunt for Service RobotsCode0
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning0
Pretraining Reward-Free Representations for Data-Efficient Reinforcement Learning0
Parametrized quantum policies for reinforcement learning0
Solipsistic Reinforcement Learning0
Resolving Causal Confusion in Reinforcement Learning via Robust Exploration0
Minimum Description Length Skills for Accelerated Reinforcement Learning0
Out-of-distribution generalization of internal models is correlated with reward0
Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning0
Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation0
Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning0
Show:102550
← PrevPage 187 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified