| Deep Q-Network-Driven Catheter Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning and Dual-UNet | Jun 25, 2020 | Q-Learning | —Unverified | 0 |
| Deep Q-Network for Stochastic Process Environments | Aug 7, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 |
| Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot | Oct 1, 2022 | Q-Learning | —Unverified | 0 |
| Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense | Jan 28, 2023 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Fuzzing | Jan 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Adaptive Stochastic Resource Control: A Machine Learning Approach | Jan 15, 2014 | BIG-bench Machine LearningClustering | —Unverified | 0 |
| Deep reinforcement learning applied to an assembly sequence planning problem with user preferences | Apr 13, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation | Oct 7, 2023 | Q-Learning | —Unverified | 0 |
| Analytics of Business Time Series Using Machine Learning and Bayesian Inference | May 25, 2022 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 |
| Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise | Jan 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 |
| Analytically Tractable Bayesian Deep Q-Learning | Jun 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit | Nov 18, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Automatic Reward Shaping from Confounded Offline Data | May 16, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL | May 28, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch | Jan 23, 2025 | Graph AttentionGraph Sampling | —Unverified | 0 |
| Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent | Jul 15, 2020 | Atari GamesQ-Learning | —Unverified | 0 |
| Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games | May 27, 2024 | Q-Learning | —Unverified | 0 |
| Blackwell Online Learning for Markov Decision Processes | Dec 28, 2020 | Learning TheoryQ-Learning | —Unverified | 0 |
| A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks | Apr 30, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Differentiable Quantum Architecture Search for Quantum Reinforcement Learning | Sep 19, 2023 | Q-LearningQuantum Machine Learning | —Unverified | 0 |
| Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning | Oct 28, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| An Adiabatic Theorem for Policy Tracking with TD-learning | Oct 24, 2020 | Q-Learning | —Unverified | 0 |
| Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making | May 12, 2025 | Bayesian InferenceDecision Making | —Unverified | 0 |
| A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation | Sep 10, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions | May 15, 2020 | Q-Learning | —Unverified | 0 |
| 3D Simulation for Robot Arm Control with Deep Q-Learning | Sep 13, 2016 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading | Feb 9, 2023 | Edge-computingQ-Learning | —Unverified | 0 |
| Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning | Aug 6, 1999 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Best Possible Q-Learning | Feb 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks | Jun 6, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Benchmarking projective simulation in navigation problems | Apr 23, 2018 | BenchmarkingQ-Learning | —Unverified | 0 |
| A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids | Jan 5, 2024 | Q-LearningScheduling | —Unverified | 0 |
| A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 |
| Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways | Dec 6, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Amortized Noisy Channel Neural Machine Translation | Dec 16, 2021 | Imitation LearningKnowledge Distillation | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach For Safe and Efficient Behavior Planning Of Connected Autonomous Vehicles | Mar 9, 2020 | Autonomous VehiclesMulti-agent Reinforcement Learning | —Unverified | 0 |
| A deep Q-Learning based Path Planning and Navigation System for Firefighting Environments | Nov 12, 2020 | Q-Learning | —Unverified | 0 |
| DGFN: Double Generative Flow Networks | Oct 30, 2023 | Drug DiscoveryQ-Learning | —Unverified | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| β-DQN: Improving Deep Q-Learning By Evolving the Behavior | Jan 1, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Aug 17, 2016 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes | Oct 4, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |