SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 101125 of 1918 papers

TitleStatusHype
Reinforcement Learning for Quantum Circuit Design: Using Matrix Representations0
Music Generation using Human-In-The-Loop Reinforcement Learning0
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework0
BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch0
Random-Key Algorithms for Optimizing Integrated Operating Room Scheduling0
SPEQ: Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning0
Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning0
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning0
Online inductive learning from answer sets for efficient reinforcement learning exploration0
An Empirical Study of Deep Reinforcement Learning in Continuing TasksCode0
Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis0
Deep Transfer Q-Learning for Offline Non-Stationary Reinforcement Learning0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients0
Dynamic Optimization of Storage Systems Using Reinforcement Learning Techniques0
Protein Structure Prediction in the 3D HP Model Using Deep Reinforcement Learning0
A Reinforcement Learning-Based Task Mapping Method to Improve the Reliability of Clustered Manycores0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks0
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning0
Decoding fairness: a reinforcement learning perspectiveCode0
MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal ControlCode0
Distribution-Free Uncertainty Quantification in Mechanical Ventilation Treatment: A Conformal Deep Q-Learning Framework0
Neural-Network-Driven Reward Prediction as a Heuristic: Advancing Q-Learning for Mobile Robot Path Planning0
PickLLM: Context-Aware RL-Assisted Large Language Model Routing0
Show:102550
← PrevPage 5 of 77Next →

No leaderboard results yet.