SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 61516175 of 15113 papers

TitleStatusHype
Multi-fidelity reinforcement learning framework for shape optimization0
Reward-Free Policy Space Compression for Reinforcement Learning0
A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric VehiclesCode1
Continual Auxiliary Task Learning0
Behaviour-neutral Smart Charging of Plugin Electric Vehicles: Reinforcement learning approach0
A policy gradient approach for optimization of smooth risk measures0
Behaviour-Diverse Automatic Penetration Testing: A Curiosity-Driven Multi-Objective Deep Reinforcement Learning Approach0
A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning0
Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning0
Reinforcement Learning Framework for Server Placement and Workload Allocation in Multi-Access Edge Computing0
Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks0
Autonomous Warehouse Robot using Deep Q-Learning0
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided MarketsCode0
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement LearningCode1
Accelerating Primal-dual Methods for Regularized Markov Decision Processes0
CCPT: Automatic Gameplay Testing and Validation with Curiosity-Conditioned Proximal Trajectories0
Learning Causal Overhypotheses through Exploration in Children and Computational Models0
Rule Mining over Knowledge Graphs via Reinforcement Learning0
Selective Credit Assignment0
PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning0
Who Are the Best Adopters? User Selection Model for Free Trial Item Promotion0
Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic0
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training0
Shaping Advice in Deep Reinforcement LearningCode0
Transformation Coding: Simple Objectives for Equivariant Representations0
Show:102550
← PrevPage 247 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified