SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 44514500 of 15113 papers

TitleStatusHype
A Graph Neural Network-Based QUBO-Formulated Hamiltonian-Inspired Loss Function for Combinatorial Optimization using Reinforcement Learning0
A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems0
A gray-box approach for curriculum learning0
A Greedy Approximation of Bayesian Reinforcement Learning with Probably Optimistic Transition Model0
A Guider Network for Multi-Dual Learning0
A Guiding Principle for Causal Decision Problems0
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement0
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines0
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat0
A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning0
A Hierarchical Hybrid Learning Framework for Multi-agent Trajectory Prediction0
A Hierarchical Model for Device Placement0
A Hierarchical Reinforcement Learning Method for Persistent Time-Sensitive Tasks0
A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning0
A Homogenization Approach for Gradient-Dominated Stochastic Optimization0
A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming0
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors0
A Human Mixed Strategy Approach to Deep Reinforcement Learning0
A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression0
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
A Hybrid Neuro-Symbolic approach for Text-Based Games using Inductive Logic Programming0
A Hybrid PAC Reinforcement Learning Algorithm0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
AI-as-a-Service Toolkit for Human-Centered Intelligence in Autonomous Driving0
AI Assisted Annotator using Reinforcement Learning0
AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks0
AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments0
AI-based Robust Resource Allocation in End-to-End Network Slicing under Demand and CSI Uncertainties0
AI-based traffic analysis in digital twin networks0
AI-driven materials design: a mini-review0
AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms0
AI-Driven Resource Allocation in Optical Wireless Communication Systems0
AIGB: Generative Auto-bidding via Conditional Diffusion Modeling0
AIGenC: An AI generalisation model via creativity0
AI Planning: A Primer and Survey (Preliminary Report)0
AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning0
AI Recommendation Systems for Lane-Changing Using Adherence-Aware Reinforcement Learning0
AirRL: A Reinforcement Learning Approach to Urban Air Quality Inference0
AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework0
AITuning: Machine Learning-based Tuning Tool for Run-Time Communication Libraries0
Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning over Noisy Channels0
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces0
A Language Model based Evaluator for Sentence Compression0
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
A Learned Simulation Environment to Model Plant Growth in Indoor Farming0
A Learned Simulation Environment to Model Student Engagement and Retention in Automated Online Courses0
A Learning based Branch and Bound for Maximum Common Subgraph Problems0
A Learning-Exploring Method to Generate Diverse Paraphrases with Multi-Objective Deep Reinforcement Learning0
A Learning Framework for High Precision Industrial Assembly0
Show:102550
← PrevPage 90 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified