SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 801850 of 1918 papers

TitleStatusHype
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing0
Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach0
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits0
G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning0
Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control0
Enhancing Q-Learning with Large Language Model Heuristics0
Challenging On Car Racing Problem from OpenAI gym0
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation0
Enhancing Classification Performance via Reinforcement Learning for Feature Selection0
GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization0
Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning0
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery0
Graph Exploration for Effective Multi-agent Q-Learning0
Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles0
Graph Q-Learning for Combinatorial Optimization0
Greedy-Step Off-Policy Reinforcement Learning0
Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning0
Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition0
Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks0
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution0
Guiding Reinforcement Learning Exploration Using Natural Language0
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension0
Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time0
Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning0
A new multilayer optical film optimal method based on deep q-learning0
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control0
Action Learning for 3D Point Cloud Based Organ Segmentation0
HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search0
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search0
Hedging using reinforcement learning: Contextual k-Armed Bandit versus Q-learning0
Enhanced Deep Q-Learning for 2D Self-Driving Cars: Implementation and Evaluation on a Custom Track Environment0
Energy Sharing for Multiple Sensor Nodes with Finite Buffers0
Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process0
Hierarchical clustering with deep Q-learning0
Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision0
Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity0
Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem0
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization0
High dimensional precision medicine from patient-derived xenografts0
High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning0
Highway Reinforcement Learning0
Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task0
How to discretize continuous state-action spaces in Q-learning: A symbolic control approach0
Human and Multi-Agent collaboration in a human-MARL teaming framework0
Hybridizing the 1/5-th Success Rule with Q-Learning for Controlling the Mutation Rate of an Evolutionary Algorithm0
Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication0
Hybrid Policies Using Inverse Rewards for Reinforcement Learning0
Hybrid Q-Learning Applied to Ubiquitous recommender system0
A new convergent variant of Q-learning with linear function approximation0
Show:102550
← PrevPage 17 of 39Next →

No leaderboard results yet.