SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 2650 of 1918 papers

TitleStatusHype
DisCor: Corrective Feedback in Reinforcement Learning via Distribution CorrectionCode1
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via DiscretisationCode1
Dropout Q-Functions for Doubly Efficient Reinforcement LearningCode1
Energy-based Surprise Minimization for Multi-Agent Value FactorizationCode1
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement LearningCode1
Conservative Q-Learning for Offline Reinforcement LearningCode1
Deep Recurrent Q-Learning for Partially Observable MDPsCode1
Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19Code1
Backprop-Free Reinforcement Learning with Active Neural Generative CodingCode1
When should we prefer Decision Transformers for Offline Reinforcement Learning?Code1
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
Benchmarking Batch Deep Reinforcement Learning AlgorithmsCode1
Acting in Delayed Environments with Non-Stationary Markov PoliciesCode1
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-TuningCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Boosting Continuous Control with Consistency PolicyCode1
Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman ProblemCode1
Reinforcement Learning in High-frequency Market MakingCode1
Continuous control with deep reinforcement learningCode1
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement LearningCode1
Deep Active Inference for Partially Observable MDPsCode1
Deep Reinforcement Learning with Double Q-learningCode1
Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial DetectionCode1
A Recipe for Unbounded Data Augmentation in Visual Reinforcement LearningCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Show:102550
← PrevPage 2 of 77Next →

No leaderboard results yet.