SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 14011450 of 1918 papers

TitleStatusHype
Regret Bounds for Discounted MDPs0
Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis0
Learning State Abstractions for Transfer in Continuous ControlCode0
GLSearch: Maximum Common Subgraph Detection via Learning to Search0
Manipulating Reinforcement Learning: Poisoning Attacks on Cost Signals0
Safe Wasserstein Constrained Deep Q-Learning0
A Stochastic Game Framework for Efficient Energy Management in Microgrid NetworksCode1
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes0
Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning0
Autonomous Control of a Line Follower Robot Using a Q-Learning Controller0
Q-Learning in enormous action spaces via amortized approximate maximization0
Discriminator Soft Actor Critic without Extrinsic RewardsCode1
Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping0
A storage expansion planning framework using reinforcement learning and simulation-based optimization0
A Probabilistic Simulator of Spatial Demand for Product Allocation0
EEG-based Drowsiness Estimation for Driving Safety using Deep Q-Learning0
Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar0
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
SVQN: Sequential Variational Soft Q-Learning Networks0
Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog0
Information Theoretic Model Predictive Q-Learning0
The Gambler's Problem and Beyond0
Learning in Discounted-cost and Average-cost Mean-field Games0
Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time0
Learning an Interpretable Traffic Signal Control PolicyCode0
Soft Q Network0
Sepsis World Model: A MIMIC-based OpenAI Gym "World Model" Simulator for Sepsis Treatment0
Provably Efficient Reinforcement Learning with Aggregated States0
High dimensional precision medicine from patient-derived xenografts0
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation0
Value-of-Information based Arbitration between Model-based and Model-free Control0
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill DiscoveryCode0
Combining Q-Learning and Search with Amortized Value Estimates0
Reinforcement Learning with Non-Markovian Rewards0
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms0
Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks0
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning0
Neural Temporal-Difference Learning Converges to Global Optima0
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle0
Propagating Uncertainty in Reinforcement Learning via Wasserstein BarycentersCode0
Privacy-Preserving Q-Learning with Functional Noise in Continuous SpacesCode0
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach0
Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles0
QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc NetworksCode0
Join Query Optimization with Deep Reinforcement Learning AlgorithmsCode0
Control-Tutored Reinforcement Learning: an application to the Herding Problem0
Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks0
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control0
Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning0
Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning0
Show:102550
← PrevPage 29 of 39Next →

No leaderboard results yet.