SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 14511500 of 1918 papers

TitleStatusHype
Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time0
Soft Q Network0
Sepsis World Model: A MIMIC-based OpenAI Gym "World Model" Simulator for Sepsis Treatment0
High dimensional precision medicine from patient-derived xenografts0
Provably Efficient Reinforcement Learning with Aggregated States0
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation0
Value-of-Information based Arbitration between Model-based and Model-free Control0
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill DiscoveryCode0
Reinforcement Learning with Non-Markovian Rewards0
Combining Q-Learning and Search with Amortized Value Estimates0
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms0
Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks0
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning0
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach0
Propagating Uncertainty in Reinforcement Learning via Wasserstein BarycentersCode0
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle0
Privacy-Preserving Q-Learning with Functional Noise in Continuous SpacesCode0
Neural Temporal-Difference Learning Converges to Global Optima0
Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles0
QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc NetworksCode0
Control-Tutored Reinforcement Learning: an application to the Herding Problem0
Join Query Optimization with Deep Reinforcement Learning AlgorithmsCode0
Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks0
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control0
Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning0
Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning0
Efficient Drone Mobility Support Using Reinforcement Learning0
Quantum Observables for continuous control of the Quantum Approximate Optimization Algorithm via Reinforcement Learning0
Asymptotics of Reinforcement Learning with Neural Networks0
Modelling Bahdanau Attention using Election methods aided by Q-Learning0
Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach0
Challenging On Car Racing Problem from OpenAI gym0
On Solving the 2-Dimensional Greedy Shooter Problem for UAVsCode0
Generalized Speedy Q-learningCode0
Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning0
Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning0
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement LearningCode0
D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments0
ZPD Teaching Strategies for Deep Reinforcement Learning from DemonstrationsCode0
Deep Q-Learning for Same-Day Delivery with Vehicles and Drones0
Momentum-based Accelerated Q-learningCode0
Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation0
Policy Learning for Malaria ControlCode0
Reverse Experience Replay0
Automatic Data Augmentation by Learning the Deterministic PolicyCode0
Adaptive Discretization for Episodic Reinforcement Learning in Metric SpacesCode0
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central InferenceCode0
On the Reduction of Variance and Overestimation of Deep Q-Learning0
Zap Q-Learning With Nonlinear Function Approximation0
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments0
Show:102550
← PrevPage 30 of 39Next →

No leaderboard results yet.