Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 1918 papers

Title	Date	Tasks	Status	Hype
Q-Learning based system for path planning with unmanned aerial vehicles swarms in obstacle environments	Mar 30, 2023	Q-Learning	—Unverified	0
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise	Mar 28, 2023	Q-Learning	—Unverified	0
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization	Mar 28, 2023	D4RLOffline RL	CodeCode Available	1
Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems	Mar 25, 2023	Q-Learning	—Unverified	0
Specific investments under negotiated transfer pricing: effects of different surplus sharing parameters on managerial performance: An agent-based simulation with fuzzy Q-learning agents	Mar 25, 2023	Q-Learning	—Unverified	0
Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning	Mar 24, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Artificial Intelligence and Dual Contract	Mar 22, 2023	Q-Learning	—Unverified	0
Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms	Mar 17, 2023	Q-Learningreinforcement-learning	—Unverified	0
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control	Mar 17, 2023	Q-Learningreinforcement-learning	—Unverified	0
Self-Inspection Method of Unmanned Aerial Vehicles in Power Plants Using Deep Q-Network Reinforcement Learning	Mar 16, 2023	Autonomous NavigationQ-Learning	—Unverified	0
Smoothed Q-learning	Mar 15, 2023	Q-Learningreinforcement-learning	—Unverified	0
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera	Mar 13, 2023	Privacy PreservingQ-Learning	CodeCode Available	0
The tree reconstruction game: phylogenetic reconstruction using reinforcement learning	Mar 12, 2023	Q-Learningreinforcement-learning	—Unverified	0
Ignorance is Bliss: Robust Control via Information Gating	Mar 10, 2023	Inductive BiasQ-Learning	—Unverified	0
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning	Mar 10, 2023	Federated LearningKnowledge Distillation	—Unverified	0
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning	Mar 9, 2023	Offline RLQ-Learning	CodeCode Available	1
Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments	Mar 9, 2023	FormQ-Learning	—Unverified	0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning	Mar 7, 2023	Continuous ControlOffline RL	—Unverified	0
Exploration via Epistemic Value Estimation	Mar 7, 2023	Decision MakingEfficient Exploration	—Unverified	0
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control	Mar 4, 2023	MuJoCoQ-Learning	—Unverified	0
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games	Mar 4, 2023	Atari GamesDeep Reinforcement Learning	—Unverified	0
Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning	Mar 3, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning	Mar 2, 2023	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	0
A Deep Reinforcement Learning Trader without Offline Training	Mar 1, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
The Point to Which Soft Actor-Critic Converges	Mar 1, 2023	Q-Learning	—Unverified	0
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation	Mar 1, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning	Mar 1, 2023	Continuous ControlImitation Learning	CodeCode Available	1
Minimizing the Outage Probability in a Markov Decision Process	Feb 28, 2023	Q-Learningreinforcement-learning	—Unverified	0
A Finite Sample Complexity Bound for Distributionally Robust Q-learning	Feb 26, 2023	Q-Learning	—Unverified	0
Q-Cogni: An Integrated Causal Reinforcement Learning Framework	Feb 26, 2023	Causal InferenceDecision Making	—Unverified	0
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation	Feb 25, 2023	Offline RLQ-Learning	—Unverified	0
On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process	Feb 25, 2023	Q-Learningreinforcement-learning	—Unverified	0
Robust Auto-landing Control of an agile Regional Jet Using Fuzzy Q-learning	Feb 21, 2023	Q-Learningreinforcement-learning	—Unverified	0
Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes	Feb 21, 2023	Learning TheoryMedical Diagnosis	—Unverified	0
Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning	Feb 21, 2023	Q-Learningreinforcement-learning	CodeCode Available	0
Forecasting and stabilizing chaotic regimes in two macroeconomic models via artificial intelligence technologies and control methods	Feb 20, 2023	Decision MakingEvolutionary Algorithms	—Unverified	0
Logit-Q Dynamics for Efficient Learning in Stochastic Teams	Feb 20, 2023	Q-Learning	—Unverified	0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications	Feb 15, 2023	Decision MakingManagement	—Unverified	0
Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data	Feb 15, 2023	Q-Learningvalid	—Unverified	0
Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels	Feb 13, 2023	Q-Learning	—Unverified	0
A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning	Feb 13, 2023	energy managementManagement	—Unverified	0
Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading	Feb 9, 2023	Edge-computingQ-Learning	—Unverified	0
MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework	Feb 7, 2023	Q-Learning	—Unverified	0
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms	Feb 7, 2023	Q-Learning	—Unverified	0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage	Feb 5, 2023	Offline RLQ-Learning	—Unverified	0
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition	Feb 2, 2023	DiversityQ-Learning	—Unverified	0
Best Possible Q-Learning	Feb 2, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Sample Complexity of Kernel-Based Q-Learning	Feb 1, 2023	Q-LearningReinforcement Learning (RL)	—Unverified	0
Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense	Jan 28, 2023	Adversarial AttackDeep Reinforcement Learning	—Unverified	0
RCsearcher: Reaction Center Identification in Retrosynthesis via Deep Q-Learning	Jan 28, 2023	Deep Reinforcement LearningGraph Neural Network	—Unverified	0

Show:10 25 50

← PrevPage 13 of 39Next →

No leaderboard results yet.