SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 96519700 of 15113 papers

TitleStatusHype
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function ApproximatorCode1
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning0
D2RL: Deep Dense Architectures in Reinforcement LearningCode1
A Reinforcement Learning Approach to Health Aware Control Strategy0
Knowledge-guided Open Attribute Value Extraction with Reinforcement LearningCode1
Evaluating the Safety of Deep Reinforcement Learning Models using Semi-Formal Verification0
Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous ControlCode1
Average-reward model-free reinforcement learning: a systematic review and literature mapping0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Model-Based Inverse Reinforcement Learning from Visual Demonstrations0
Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for Cellular Offloading0
Neural Algorithms for Graph Navigation0
Scalable Evolution Strategies Pipeline for Solving the Vehicle Routing Problem0
Learning Lower Bounds for Graph Exploration With Reinforcement Learning0
Learning Elimination Ordering for Tree Decomposition Problem0
Assessment of Reward Functions in Reinforcement Learning for Multi-Modal Urban Traffic Control under Real-World limitations0
Approximate information state for approximate planning and reinforcement learning in partially observed systemsCode1
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning0
Robot Navigation in Constrained Pedestrian Environments using Reinforcement LearningCode1
Reinforcement Learning for Efficient and Tuning-Free Link Adaptation0
Uncertainty-aware Contact-safe Model-based Reinforcement Learning0
Decomposability and Parallel Computation of Multi-Agent LQR0
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Few-shot model-based adaptation in noisy conditions0
Autonomous Control of a Particle Accelerator using Deep Reinforcement Learning0
Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation0
Hyperparameter Auto-tuning in Self-Supervised Robotic LearningCode0
Interpretable Disease Prediction based on Reinforcement Path Reasoning over Knowledge Graphs0
Explanation Augmented Feedback in Human-in-the-Loop Reinforcement Learning0
A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning0
Blending Search and Discovery: Tag-Based Query Refinement with Contextual Reinforcement Learning0
A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learningCode1
Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards0
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning ApproachCode0
An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards0
MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning AgentsCode0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design0
Deep Learning of Koopman Representation for Control0
ALPaCA vs. GP-based Prior Learning: A Comparison between two Bayesian Meta-Learning AlgorithmsCode0
Optimal Dispatch in Emergency Service System via Reinforcement Learning0
Constrained Model-based Reinforcement Learning with Robust Cross-Entropy MethodCode1
Masked Contrastive Representation Learning for Reinforcement LearningCode1
Multi-Agent Trust Region Policy OptimizationCode0
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous ControlCode1
Local Differential Privacy for Regret Minimization in Reinforcement Learning0
Reinforcement Learning Based Temporal Logic Control with Maximum Probabilistic SatisfactionCode0
Self-Imitation Learning for Robot Tasks with Sparse and Delayed RewardsCode0
Modeling Protagonist Emotions for Emotion-Aware StorytellingCode1
UAV Path Planning using Global and Local Map Information with Deep Reinforcement LearningCode1
Show:102550
← PrevPage 194 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified