SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 551600 of 1918 papers

TitleStatusHype
Model-based versus model-free feeding control and water quality monitoring for fish growth tracking in aquaculture systems0
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care0
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach0
Approximate information state based convergence analysis of recurrent Q-learning0
Active Inference in Hebbian Learning Networks0
Agent Performing Autonomous Stock Trading under Good and Bad SituationsCode0
Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor0
Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseCode0
VA-learning as a more efficient alternative to Q-learning0
Sample Complexity of Variance-reduced Distributionally Robust Q-learning0
A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market0
Reinforcement Learning With Reward Machines in Stochastic Games0
MADiff: Offline Multi-agent Learning with Diffusion ModelsCode1
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks0
RSRM: Reinforcement Symbolic Regression Machine0
OER: Offline Experience Replay for Continual Offline Reinforcement Learning0
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks0
Bayesian Risk-Averse Q-Learning with Streaming Observations0
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond0
Model-Free Robust Average-Reward Reinforcement Learning0
Smart Home Energy Management: VAE-GAN synthetic dataset generator and Q-learning0
Mastering Percolation-like Games with Deep LearningCode0
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm0
Deep Q-Learning-based Distribution Network Reconfiguration for Reliability Improvement0
Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Electric Vehicle Energy ManagementCode0
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable EnvironmentsCode0
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading0
Safe Q-learning for continuous-time linear systems0
Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach0
Learned Collusion0
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion PoliciesCode1
Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze ProblemsCode0
Graph Exploration for Effective Multi-agent Q-Learning0
Quantum deep Q learning with distributed prioritized experience replay0
A study on a Q-Learning algorithm application to a manufacturing assembly problem0
Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement LearningCode0
Exploring the Noise Resilience of Successor Features and Predecessor Features Algorithms in One and Two-Dimensional Environments0
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences0
RELS-DQN: A Robust and Efficient Local Search Framework for Combinatorial Optimization0
Reinforcement Learning Based Minimum State-flipped Control for the Reachability of Boolean Control Networks0
Automaton-Guided Curriculum Generation for Reinforcement Learning AgentsCode0
Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural NetworksCode0
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion0
Deep Reinforcement Learning Based Optimal Infinite-Horizon Control of Probabilistic Boolean Control Networks0
Quantitative Trading using Deep Q Learning0
A Tutorial Introduction to Reinforcement Learning0
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization0
Show:102550
← PrevPage 12 of 39Next →

No leaderboard results yet.