| Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity | Feb 17, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity | Dec 1, 2020 | Q-Learning | —Unverified | 0 | 0 |
| A Graph Attention Learning Approach to Antenna Tilt Optimization | Dec 27, 2021 | Graph AttentionQ-Learning | —Unverified | 0 | 0 |
| A Hybrid PAC Reinforcement Learning Algorithm | Sep 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 | 0 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions | Sep 20, 2023 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 | 0 |
| AI on the Water: Applying DRL to Autonomous Vessel Navigation | Oct 23, 2023 | Collision AvoidanceDecision Making | —Unverified | 0 | 0 |
| A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks | Mar 10, 2021 | Q-LearningScheduling | —Unverified | 0 | 0 |
| A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows | May 9, 2025 | Combinatorial OptimizationLanguage Modeling | —Unverified | 0 | 0 |
| A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management | Mar 2, 2022 | ManagementQ-Learning | —Unverified | 0 | 0 |
| Algorithmic Collusion and Price Discrimination: The Over-Usage of Data | Mar 10, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Algorithmic Collusion under Observed Demand Shocks | Feb 20, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Algorithmic Trading with Fitted Q Iteration and Heston Model | May 18, 2018 | Algorithmic TradingQ-Learning | —Unverified | 0 | 0 |
| A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning | Feb 13, 2023 | energy managementManagement | —Unverified | 0 | 0 |
| Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise | Nov 20, 2024 | Q-Learning | —Unverified | 0 | 0 |
| A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants | Feb 2, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets | Mar 11, 2022 | BIG-bench Machine LearningManagement | —Unverified | 0 | 0 |
| A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks | Jul 20, 2020 | BIG-bench Machine LearningEdge-computing | —Unverified | 0 | 0 |
| A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning | Aug 1, 2022 | Asset ManagementDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret | Jun 8, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes | Oct 4, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Amortized Noisy Channel Neural Machine Translation | Dec 16, 2021 | Imitation LearningKnowledge Distillation | —Unverified | 0 | 0 |
| Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways | Dec 6, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks | Jun 6, 2020 | Q-LearningScheduling | —Unverified | 0 | 0 |
| A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation | Sep 10, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| An Adiabatic Theorem for Policy Tracking with TD-learning | Oct 24, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games | May 27, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent | Jul 15, 2020 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit | Nov 18, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Analytically Tractable Bayesian Deep Q-Learning | Jun 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Analytics of Business Time Series Using Machine Learning and Bayesian Inference | May 25, 2022 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 | 0 |
| Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense | Jan 28, 2023 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 | 0 |
| An Attempt to Model Human Trust with Reinforcement Learning | Sep 29, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation | Nov 26, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| An Efficient and Uncertainty-aware Reinforcement Learning Framework for Quality Assurance in Extrusion Additive Manufacturing | Mar 2, 2025 | Q-LearningUncertainty Quantification | —Unverified | 0 | 0 |
| An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems | Dec 6, 2023 | Q-Learning | —Unverified | 0 | 0 |
| An Elementary Proof that Q-learning Converges Almost Surely | Aug 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments | Jan 6, 2024 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning | Oct 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Network Simulation of OTC Markets with Multiple Agents | May 3, 2024 | Q-Learning | —Unverified | 0 | 0 |
| An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS | May 26, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward | Sep 24, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A new convergent variant of Q-learning with linear function approximation | Dec 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A new multilayer optical film optimal method based on deep q-learning | Dec 7, 2018 | Q-Learning | —Unverified | 0 | 0 |
| An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation | May 25, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 | 0 |
| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |