SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 81518200 of 15113 papers

TitleStatusHype
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback0
Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design0
Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding0
Multi-Agent Path Planning Using Deep Reinforcement Learning0
Learning to Assist Agents by Observing Them0
Automating Privilege Escalation with Deep Reinforcement Learning0
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation0
Behaviour-conditioned policies for cooperative reinforcement learning tasks0
Decentralized Safe Reinforcement Learning for Voltage Control0
A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances0
DRL-Clusters: Buffer Management with Clustering based Deep Reinforcement Learning0
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations0
Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming EventsCode0
Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement LearningCode0
Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning0
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
DNN-Opt: An RL Inspired Optimization for Analog Circuit Sizing using Deep Neural Networks0
Guiding Evolutionary Strategies by Differentiable Robot SimulatorsCode0
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement LearningCode0
Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning0
Divergence-Regularized Multi-Agent Actor-Critic0
Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning0
Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems0
Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning0
Multi-lane Cruising Using Hierarchical Planning and Reinforcement Learning0
Is Policy Learning Overrated?: Width-Based Planning and Active Learning for AtariCode0
Stability Constrained Reinforcement Learning for Real-Time Voltage Control0
Neural Network Verification in Control0
Surveillance Evasion Through Bayesian Reinforcement LearningCode0
Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance0
Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces0
Solving the Real Robot Challenge using Deep Reinforcement LearningCode0
Reinforcement Learning with Information-Theoretic Actuation0
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators0
Bitcoin Transaction Strategy Construction Based on Deep Reinforcement Learning0
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines0
A Privacy-preserving Distributed Training Framework for Cooperative Multi-agent Deep Reinforcement Learning0
Coordinated Reinforcement Learning for Optimizing Mobile Networks0
HLIC: Harmonizing Optimization Metrics in Learned Image Compression by Reinforcement Learning0
A Flexible Measurement of Diversity in Datasets with Random Network Distillation0
Generalisation in Lifelong Reinforcement Learning through Logical Composition0
Detecting Worst-case Corruptions via Loss Landscape Curvature in Deep Reinforcement Learning0
Coordinated Attacks Against Federated Learning: A Multi-Agent Reinforcement Learning Approach0
An Attempt to Model Human Trust with Reinforcement Learning0
Fully Decentralized Model-based Policy Optimization with Networked Agents0
Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning0
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0
Show:102550
← PrevPage 164 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified