| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Tutorial Introduction to Reinforcement Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Adaptive Stochastic Resource Control: A Machine Learning Approach | Jan 15, 2014 | BIG-bench Machine LearningClustering | —Unverified | 0 |
| Applying Reinforcement Learning to Option Pricing and Hedging | Oct 6, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Active Inference in Hebbian Learning Networks | Jun 8, 2023 | OpenAI GymQ-Learning | —Unverified | 0 |
| Causal Mean Field Multi-Agent Reinforcement Learning | Feb 20, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Application of Deep Q-Network in Portfolio Management | Mar 13, 2020 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 |
| Adversarial Agents For Attacking Inaudible Voice Activated Devices | Jul 23, 2023 | CyberBattleSimQ-Learning | —Unverified | 0 |
| Application of Deep Q Learning with Simulation Results for Elevator Optimization | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games | Mar 17, 2025 | Atari GamesQ-Learning | —Unverified | 0 |
| Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement | Apr 12, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples | Jun 28, 2020 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens | Jul 13, 2021 | Q-Learning | —Unverified | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 |
| Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval | Jan 10, 2024 | Q-LearningRhythm | —Unverified | 0 |
| AoI Minimization in Status Update Control with Energy Harvesting Sensors | Sep 9, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks | Feb 22, 2020 | Q-Learning | —Unverified | 0 |
| Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models | Nov 9, 2023 | Algorithmic TradingQ-Learning | —Unverified | 0 |
| Accelerating Goal-Directed Reinforcement Learning by Model Characterization | Jan 4, 2019 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems | Aug 2, 2024 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles | Oct 12, 2022 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Combining policy gradient and Q-learning | Nov 5, 2016 | Atari GamesQ-Learning | —Unverified | 0 |
| An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models | May 9, 2024 | Hierarchical Reinforcement LearningManagement | —Unverified | 0 |
| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Dec 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach | Aug 10, 2022 | Bayesian InferenceQ-Learning | —Unverified | 0 |
| A Novel Reinforcement Learning Model for Post-Incident Malware Investigations | Oct 19, 2024 | Malware DetectionQ-Learning | —Unverified | 0 |
| Active Deep Q-learning with Demonstration | Dec 6, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| CAQL: Continuous Action Q-Learning | Sep 26, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game | Mar 9, 2025 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Accelerated Value Iteration via Anderson Mixing | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 |
| A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle | Mar 22, 2022 | Q-Learning | —Unverified | 0 |
| An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems | Jun 9, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach | Sep 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimal Online Method of Selecting Source Policies for Reinforcement Learning | Sep 24, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms | Mar 27, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms | Feb 20, 2025 | Q-Learning | —Unverified | 0 |
| Anomaly Detection via Learning-Based Sequential Controlled Sensing | Nov 30, 2023 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | Jun 24, 2023 | Atari GamesDecision Making | —Unverified | 0 |
| Action-modulated midbrain dopamine activity arises from distributed control policies | Jul 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions | Feb 2, 2025 | Q-Learning | —Unverified | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Trader without Offline Training | Mar 1, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 |