SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 251300 of 1918 papers

TitleStatusHype
Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning0
A Tutorial Introduction to Reinforcement Learning0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
Adaptive Stochastic Resource Control: A Machine Learning Approach0
Applying Reinforcement Learning to Option Pricing and Hedging0
Active Inference in Hebbian Learning Networks0
Causal Mean Field Multi-Agent Reinforcement Learning0
Application of Deep Reinforcement Learning to Payment Fraud0
Application of Deep Q-Network in Portfolio Management0
Adversarial Agents For Attacking Inaudible Voice Activated Devices0
Application of Deep Q Learning with Simulation Results for Elevator Optimization0
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games0
Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement0
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples0
A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens0
An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning0
Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval0
AoI Minimization in Status Update Control with Energy Harvesting Sensors0
Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks0
Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models0
Accelerating Goal-Directed Reinforcement Learning by Model Characterization0
Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems0
Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles0
Combining policy gradient and Q-learning0
An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory0
A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach0
A Novel Reinforcement Learning Model for Post-Incident Malware Investigations0
Active Deep Q-learning with Demonstration0
CAQL: Continuous Action Q-Learning0
A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game0
A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments0
Accelerated Value Iteration via Anderson Mixing0
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle0
An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems0
A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint0
Accelerated Target Updates for Q-learning0
Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach0
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning0
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms0
A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms0
Anomaly Detection via Learning-Based Sequential Controlled Sensing0
Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query0
Action-modulated midbrain dopamine activity arises from distributed control policies0
An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions0
A Differentiable Physics Engine for Deep Learning in Robotics0
A Deep Reinforcement Learning Trader without Offline Training0
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking0
An Independent Study of Reinforcement Learning and Autonomous Driving0
Show:102550
← PrevPage 6 of 39Next →

No leaderboard results yet.