SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 726750 of 1918 papers

TitleStatusHype
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning0
Exploratory Control with Tsallis Entropy for Latent Factor Models0
Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning0
A Deep Reinforcement Learning Trader without Offline Training0
Action-modulated midbrain dopamine activity arises from distributed control policies0
Accelerated Target Updates for Q-learning0
Equivalence Between Policy Gradients and Soft Q-Learning0
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks0
Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Chrome Dino Run using Reinforcement Learning0
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning0
Faster Deep Q-learning using Neural Episodic Control0
Faster Non-asymptotic Convergence for Double Q-learning0
Faster Q-Learning Algorithms for Restless Bandits0
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model0
Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network0
Chemoreception and chemotaxis of a three-sphere swimmer0
Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks0
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices0
Federated Q-Learning: Linear Regret Speedup with Low Communication Cost0
Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost0
Federated Stochastic Approximation under Markov Noise and Heterogeneity: Applications in Reinforcement Learning0
FedHQL: Federated Heterogeneous Q-Learning0
Ensemble Bootstrapping for Q-Learning0
Show:102550
← PrevPage 30 of 77Next →

No leaderboard results yet.