SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 73017350 of 15113 papers

TitleStatusHype
Learning Relative Return Policies With Upside-Down Reinforcement Learning0
Drawing Inductor Layout with a Reinforcement Learning Agent: Method and Application for VCO Inductors0
Comparative analysis of machine learning methods for active flow control0
Consistent Dropout for Policy Gradient Reinforcement Learning0
Reinforcement Learning in Practice: Opportunities and Challenges0
Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four0
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel0
Multi-fidelity reinforcement learning framework for shape optimization0
Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning0
Reward-Free Policy Space Compression for Reinforcement Learning0
A policy gradient approach for optimization of smooth risk measures0
Behaviour-Diverse Automatic Penetration Testing: A Curiosity-Driven Multi-Objective Deep Reinforcement Learning Approach0
Continual Auxiliary Task Learning0
A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning0
Behaviour-neutral Smart Charging of Plugin Electric Vehicles: Reinforcement learning approach0
Autonomous Warehouse Robot using Deep Q-Learning0
Learning Causal Overhypotheses through Exploration in Children and Computational Models0
CCPT: Automatic Gameplay Testing and Validation with Curiosity-Conditioned Proximal Trajectories0
Accelerating Primal-dual Methods for Regularized Markov Decision Processes0
Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks0
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided MarketsCode0
Rule Mining over Knowledge Graphs via Reinforcement Learning0
Reinforcement Learning Framework for Server Placement and Workload Allocation in Multi-Access Edge Computing0
PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning0
Selective Credit Assignment0
Shaping Advice in Deep Reinforcement LearningCode0
TransDreamer: Reinforcement Learning with Transformer World Models0
Who Are the Best Adopters? User Selection Model for Free Trial Item Promotion0
Transformation Coding: Simple Objectives for Equivariant Representations0
Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic0
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training0
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning0
Can Interpretable Reinforcement Learning Manage Prosperity Your Way?0
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
Retrieval-Augmented Reinforcement Learning0
Should I send this notification? Optimizing push notifications decision making by modeling the future0
Robust Reinforcement Learning via Genetic Curriculum0
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization0
A Survey on Deep Reinforcement Learning-based Approaches for Adaptation and Generalization0
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs0
Improving Intrinsic Exploration with Language Abstractions0
A Survey of Explainable Reinforcement Learning0
An Intrusion Response System utilizing Deep Q-Networks and System PartitionsCode0
Branching Reinforcement Learning0
Domain Adaptive Fake News Detection via Reinforcement Learning0
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo0
User-Oriented Robust Reinforcement Learning0
Interpretable Reinforcement Learning with Multilevel Subgoal Discovery0
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement LearningCode0
Show:102550
← PrevPage 147 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified