SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 251300 of 1918 papers

TitleStatusHype
Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning0
A Tutorial Introduction to Reinforcement Learning0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
Adaptive Stochastic Resource Control: A Machine Learning Approach0
Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning0
Applying Reinforcement Learning to Option Pricing and Hedging0
Active Inference in Hebbian Learning Networks0
Catalytic evolution of cooperation in a population with behavioural bimodality0
Application of Deep Reinforcement Learning to Payment Fraud0
Application of Deep Q-Network in Portfolio Management0
Adversarial Agents For Attacking Inaudible Voice Activated Devices0
Application of Deep Q Learning with Simulation Results for Elevator Optimization0
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games0
Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement0
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples0
A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens0
An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning0
Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval0
AoI Minimization in Status Update Control with Energy Harvesting Sensors0
Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks0
Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models0
Accelerating Goal-Directed Reinforcement Learning by Model Characterization0
Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems0
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms0
An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Can Q-learning solve Multi Armed Bantids?0
A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach0
A Novel Reinforcement Learning Model for Post-Incident Malware Investigations0
Active Deep Q-learning with Demonstration0
A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game0
A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments0
Accelerated Value Iteration via Anderson Mixing0
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle0
An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems0
A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint0
Accelerated Target Updates for Q-learning0
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory0
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning0
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms0
A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms0
Anomaly Detection via Learning-Based Sequential Controlled Sensing0
Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query0
Action-modulated midbrain dopamine activity arises from distributed control policies0
An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions0
A Differentiable Physics Engine for Deep Learning in Robotics0
A Deep Reinforcement Learning Trader without Offline Training0
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking0
An Independent Study of Reinforcement Learning and Autonomous Driving0
Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning0
Show:102550
← PrevPage 6 of 39Next →

No leaderboard results yet.