SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 751800 of 1918 papers

TitleStatusHype
An Independent Study of Reinforcement Learning and Autonomous Driving0
Fidelity-based Probabilistic Q-learning for Control of Quantum Systems0
Final Adaptation Reinforcement Learning for N-Player Games0
Finding the best design parameters for optical nanostructures using reinforcement learning0
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids0
Finite-Sample Analysis for SARSA with Linear Function Approximation0
A Deep Reinforcement Learning Trader without Offline Training0
Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games0
Action-modulated midbrain dopamine activity arises from distributed control policies0
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation0
Accelerated Target Updates for Q-learning0
Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning0
Equivalence Between Policy Gradients and Soft Q-Learning0
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View0
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks0
Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Chrome Dino Run using Reinforcement Learning0
Finite-Time Analysis of Simultaneous Double Q-learning0
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation0
Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise0
Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model0
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach0
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations0
Fire Threat Detection From Videos with Q-Rough Sets0
Fitted Q-Learning for Relational Domains0
Learning in Discounted-cost and Average-cost Mean-field Games0
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning0
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning0
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model0
Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals0
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game0
Chemoreception and chemotaxis of a three-sphere swimmer0
FPGA Architecture for Deep Learning and its application to Planetary Robotics0
Ensemble Bootstrapping for Q-Learning0
Characterizing the Action-Generalization Gap in Deep Q-Learning0
From r to Q^*: Your Language Model is Secretly a Q-Function0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing0
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion0
Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach0
Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control0
Enhancing Q-Learning with Large Language Model Heuristics0
Gap-Dependent Bounds for Federated Q-learning0
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition0
Gap-Dependent Bounds for Two-Player Markov Games0
GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy0
Challenging On Car Racing Problem from OpenAI gym0
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation0
Enhancing Classification Performance via Reinforcement Learning for Feature Selection0
Show:102550
← PrevPage 16 of 39Next →

No leaderboard results yet.