SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 29012950 of 15113 papers

TitleStatusHype
Attention-based Reinforcement Learning for Real-Time UAV Semantic Communication0
Attention-based QoE-aware Digital Twin Empowered Edge Computing for Immersive Virtual Reality0
AI-driven materials design: a mini-review0
Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks0
Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems0
AAMDRL: Augmented Asset Management with Deep Reinforcement Learning0
Attention-based Deep Reinforcement Learning for Multi-view Environments0
AI-based traffic analysis in digital twin networks0
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States0
Decision-making Strategy on Highway for Autonomous Vehicles using Deep Reinforcement Learning0
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling0
Decoding Polar Codes with Reinforcement Learning0
Deep Distributional Learning with Non-crossing Quantile Network0
Attention-Aware Face Hallucination via Deep Reinforcement Learning0
Attention-Aware Deep Reinforcement Learning for Video Face Recognition0
AI-based Robust Resource Allocation in End-to-End Network Slicing under Demand and CSI Uncertainties0
Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning0
AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments0
AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control0
Attend2Pack: Bin Packing through Deep Reinforcement Learning with Attention0
AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks0
Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles0
AI Assisted Annotator using Reinforcement Learning0
Adaptive Batch Size for Safe Policy Gradients0
Attacking and Defending Deep Reinforcement Learning Policies0
AI-as-a-Service Toolkit for Human-Centered Intelligence in Autonomous Driving0
AttackGNN: Red-Teaming GNNs in Hardware Security Using Reinforcement Learning0
A* Tree Search for Portfolio Management0
ACECODER: Acing Coder RL via Automated Test-Case Synthesis0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules0
Constrained Reinforcement Learning Has Zero Duality Gap0
A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection0
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning0
Deciding What's Fair: Challenges of Applying Reinforcement Learning in Online Marketplaces0
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning0
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making0
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories0
A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning0
A Hybrid PAC Reinforcement Learning Algorithm0
A Hybrid Neuro-Symbolic approach for Text-Based Games using Inductive Logic Programming0
Adaptive Aggregation for Safety-Critical Control0
Deceptive Reinforcement Learning for Privacy-Preserving Planning0
INTAGS: Interactive Agent-Guided Simulation0
At Human Speed: Deep Reinforcement Learning with Action Delay0
Adaptive Adversarial Training for Meta Reinforcement Learning0
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
A Fast Convergence Theory for Offline Decision Making0
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search0
AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning0
Show:102550
← PrevPage 59 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified