SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 19011918 of 1918 papers

TitleStatusHype
Deep Reinforcement Learning for Control of Probabilistic Boolean NetworksCode0
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency modelCode0
Learn How to Cook a New Recipe in a New House: Using Map Familiarization, Curriculum Learning, and Bandit Feedback to Learn Families of Text-Based Adventure GamesCode0
The Impact of Data Distribution on Q-learning with Function ApproximationCode0
Sampled Policy Gradient for Learning to Play the Game Agar.ioCode0
Learning an Interpretable Traffic Signal Control PolicyCode0
Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learningCode0
Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement LearningCode0
Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time SystemsCode0
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement LearningCode0
Agent Performing Autonomous Stock Trading under Good and Bad SituationsCode0
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction ApproachCode0
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function ApproximationCode0
QLBS: Q-Learner in the Black-Scholes(-Merton) WorldsCode0
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseCode0
Learning from Multiple Independent Advisors in Multi-agent Reinforcement LearningCode0
OmniEcon Nexus: Global Microeconomic Simulation EngineCode0
Show:102550
← PrevPage 39 of 39Next →

No leaderboard results yet.