SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15011550 of 1918 papers

TitleStatusHype
Adapting Double Q-Learning for Continuous Reinforcement Learning0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
Adaptive Knowledge-based Multi-Objective Evolutionary Algorithm for Hybrid Flow Shop Scheduling Problems with Multiple Parallel Batch Processing Stages0
Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach0
Adaptive Stochastic Resource Control: A Machine Learning Approach0
Adaptive Structural Hyper-Parameter Configuration by Q-Learning0
A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning0
A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization0
A deep Q-Learning based Path Planning and Navigation System for Firefighting Environments0
A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids0
A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions0
A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks0
A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise0
A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents0
A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback0
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks0
A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support0
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization0
A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control0
A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing0
A Deep Reinforcement Learning Trader without Offline Training0
A Differentiable Physics Engine for Deep Learning in Robotics0
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms0
A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models0
Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval0
Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement0
Adversarial Agents For Attacking Inaudible Voice Activated Devices0
Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach0
A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning0
A Finite Sample Complexity Bound for Distributionally Robust Q-learning0
A finite time analysis of distributed Q-learning0
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation0
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation0
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning0
A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks0
A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms0
A General Framework for Learning Mean-Field Games0
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging0
Agent-state based policies in POMDPs: Beyond belief-state MDPs0
Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation0
Age-of-information minimization via opportunistic sampling by an energy harvesting source0
Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks0
A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm0
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance0
Show:102550
← PrevPage 31 of 39Next →

No leaderboard results yet.