| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Cooperative Deep Q-learning Framework for Environments Providing Image Feedback | Oct 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Cooperative Control of Mobile Robots with Stackelberg Learning | Aug 3, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Probabilistic Simulator of Spatial Demand for Product Allocation | Jan 9, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Approximation of Convex Envelope Using Reinforcement Learning | Nov 24, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Finite Sample Complexity Bound for Distributionally Robust Q-learning | Feb 26, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Active Perception and Representation for Robotic Manipulation | Mar 15, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning | Jul 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| An Agile Adaptation Method for Multi-mode Vehicle Communication Networks | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Convex Q-Learning, Part 1: Deterministic Optimal Control | Aug 8, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Convex Q Learning in a Stochastic Environment: Extended Version | Sep 10, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Convert Language Model into a Value-based Strategic Planner | May 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation | Dec 1, 2009 | Q-Learning | —Unverified | 0 | 0 |
| Does DQN Learn? | May 26, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning | Jan 16, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Convergent Reinforcement Learning with Function Approximation: A Bilevel Optimization Perspective | Sep 27, 2018 | Bilevel OptimizationQ-Learning | —Unverified | 0 | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Approximate Nash Equilibrium Learning for n-Player Markov Games in Dynamic Pricing | Jul 13, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Convergence Results For Q-Learning With Experience Replay | Dec 8, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence | Mar 25, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Approximate Kalman Filter Q-Learning for Continuous State-Space MDPs | Sep 26, 2013 | Q-Learning | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability | Mar 22, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Approximate information state based convergence analysis of recurrent Q-learning | Jun 9, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Approximate Global Convergence of Independent Learning in Multi-Agent Systems | May 30, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning | Nov 1, 2018 | Dependency ParsingImitation Learning | —Unverified | 0 | 0 |
| Applying Reinforcement Learning to Option Pricing and Hedging | Oct 6, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach | Dec 21, 2021 | PositionQ-Learning | —Unverified | 0 | 0 |
| Active Inference in Hebbian Learning Networks | Jun 8, 2023 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Continuous-time q-learning for mean-field control problems | Jun 28, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Application of Deep Q-Network in Portfolio Management | Mar 13, 2020 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 | 0 |
| Adversarial Agents For Attacking Inaudible Voice Activated Devices | Jul 23, 2023 | CyberBattleSimQ-Learning | —Unverified | 0 | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Application of Deep Q Learning with Simulation Results for Elevator Optimization | Sep 30, 2022 | Q-Learning | —Unverified | 0 | 0 |
| APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games | Mar 17, 2025 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement | Apr 12, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples | Jun 28, 2020 | Active LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 | 0 |
| A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens | Jul 13, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| Constructing narrative using a generative model and continuous action policies | Sep 1, 2017 | Paraphrase IdentificationQ-Learning | —Unverified | 0 | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |