SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 88018825 of 15113 papers

TitleStatusHype
Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module0
Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems0
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Multi-Task Reinforcement Learning with Context-based RepresentationsCode1
Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users0
Leveraging Reinforcement Learning for evaluating Robustness of KNN Search Algorithms0
Domain Adaptation In Reinforcement Learning Via Latent Unified State RepresentationCode1
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-SupervisionCode1
Derivative-Free Reinforcement Learning: A Review0
Learning Equational Theorem Proving0
Defense Against Reward Poisoning Attacks in Reinforcement Learning0
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning0
Personalization for Web-based Services using Offline Reinforcement Learning0
Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning AlgorithmsCode0
Risk-Averse Offline Reinforcement LearningCode1
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States0
Reinforcement Learning for Optimized Beam Training in Multi-Hop Terahertz Communications0
Patterns, predictions, and actions: A story about machine learning0
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach0
Risk-Averse Bayes-Adaptive Reinforcement Learning0
Scheduling the NASA Deep Space Network with Deep Reinforcement Learning0
Adaptive Pairwise Weights for Temporal Credit Assignment0
Measuring Progress in Deep Reinforcement Learning Sample Efficiency0
rl_reach: Reproducible Reinforcement Learning Experiments for Robotic Reaching TasksCode1
Show:102550
← PrevPage 353 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified