SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 72017250 of 15113 papers

TitleStatusHype
Calibration of Derivative Pricing Models: a Multi-Agent Reinforcement Learning Perspective0
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning0
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems0
Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation0
A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets0
Deep Binary Reinforcement Learning for Scalable Verification0
Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using a Transformer-Based Deep Reinforcement Learning Framework0
Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control0
Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging taskCode0
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism0
Reinforcement Learning for Linear Quadratic Control is Vulnerable Under Cost Manipulation0
Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature ControlCode0
Random Ensemble Reinforcement Learning for Traffic Signal Control0
Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework0
Learning Torque Control for Quadrupedal Locomotion0
Artificial Intelligence in Vehicular Wireless Networks: A Case Study Using ns-30
Action-Constrained Reinforcement Learning for Frame-Level Bit Allocation in HEVC/H.265 through Frank-Wolfe Policy Optimization0
Gym-saturation: an OpenAI Gym environment for saturation provers0
Investigation of Factorized Optical Flows as Mid-Level Representations0
SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement LearningCode0
Multi-robot Cooperative Pursuit via Potential Field-Enhanced Reinforcement Learning0
Neuro-symbolic Natural Logic with Introspective Revision for Natural Language InferenceCode0
Robot Learning of Mobile Manipulation with Reachability Behavior Priors0
Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models0
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks0
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light Control0
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning0
Designing Heterogeneous GNNs with Desired Permutation Properties for Wireless Resource Allocation0
A Complete Characterization of Linear Estimators for Offline Policy Evaluation0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping0
Knowledge Transfer in Deep Reinforcement Learning for Slice-Aware Mobility Robustness Optimization0
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations0
Cascaded Gaps: Towards Gap-Dependent Regret for Risk-Sensitive Reinforcement Learning0
A Survey on Reinforcement Learning Methods in Character Animation0
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets0
Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network0
Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation0
Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility0
Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach0
Reinforcement Learning for Location-Aware Scheduling0
On Credit Assignment in Hierarchical Reinforcement LearningCode0
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning0
Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit0
Watch from sky: machine-learning-based multi-UAV network for predictive police surveillance0
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment0
Deep Reinforcement Learning based Model-free On-line Dynamic Multi-Microgrid Formation to Enhance Resilience0
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations0
Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation0
A Multi-Document Coverage Reward for RELAXed Multi-Document SummarizationCode0
Show:102550
← PrevPage 145 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified