SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 33013350 of 15113 papers

TitleStatusHype
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning0
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control0
A SUMO Framework for Deep Reinforcement Learning Experiments Solving Electric Vehicle Charging Dispatching Problem0
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems0
Decentralized Federated Reinforcement Learning for User-Centric Dynamic TFDD Control0
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach0
Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks0
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines0
Deep Reinforcement Learning for Small Bowel Path Tracking using Different Types of Annotations0
Decentralized model-free reinforcement learning in stochastic games with average-reward objective0
A Review of Deep Reinforcement Learning for Smart Building Energy Management0
A Succinct Summary of Reinforcement Learning0
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks0
Decentralized Multi-Agent Reinforcement Learning for Task Offloading Under Uncertainty0
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method0
Decentralized Multi-Robot Formation Control Using Reinforcement Learning0
Decentralized Reinforcement Learning for Multi-Target Search and Detection by a Team of Drones0
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions0
A Gentle Lecture Note on Filtrations in Reinforcement Learning0
Decentralized Safe Reinforcement Learning for Voltage Control0
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems0
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks0
AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning0
Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning0
Attention-Aware Face Hallucination via Deep Reinforcement Learning0
Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information0
Deception in Social Learning: A Multi-Agent Reinforcement Learning Perspective0
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots0
Deceptive Reinforcement Learning for Privacy-Preserving Planning0
Deceptive Reinforcement Learning in Model-Free Domains0
Deep Reinforcement Learning for Smart Home Energy Management0
CQM: Curriculum Reinforcement Learning with a Quantized World Model0
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning0
Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems0
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making0
Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning0
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon0
Decision Making in Non-Stationary Environments with Policy-Augmented Monte Carlo Tree Search0
Decision-making Strategy on Highway for Autonomous Vehicles using Deep Reinforcement Learning0
Attention-driven Robotic Manipulation0
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels0
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning0
Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition0
Decoding Molecular Graph Embeddings with Reinforcement Learning0
Decoding Polar Codes with Reinforcement Learning0
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse0
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks0
Show:102550
← PrevPage 67 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified