| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning | Oct 10, 2023 | Imitation LearningQ-Learning | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles | Apr 2, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Investigating Reinforcement Learning Agents for Continuous State Space Environments | Aug 8, 2017 | OpenAI GymQ-Learning | —Unverified | 0 |
| Investigating the Edge of Stability Phenomenon in Reinforcement Learning | Jul 9, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Investigating the Properties of Neural Network Representations in Reinforcement Learning | Mar 30, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture | Sep 15, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control | Jun 1, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Is Q-learning an Ill-posed Problem? | Feb 20, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Q-Learning Provably Efficient? An Extended Analysis | Sep 22, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Risk-Sensitive Reinforcement Learning Properly Resolved? | Jul 2, 2023 | Distributional Reinforcement LearningManagement | —Unverified | 0 |
| "Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications | Apr 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Joint Inference of Reward Machines and Policies for Reinforcement Learning | Sep 12, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator | Apr 1, 2018 | Information RetrievalQ-Learning | —Unverified | 0 |
| Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics | Apr 20, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications | Dec 8, 2023 | Q-LearningScheduling | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes | Feb 21, 2023 | Learning TheoryMedical Diagnosis | —Unverified | 0 |
| Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine | May 24, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| K-spin Hamiltonian for quantum-resolvable Markov decision processes | Apr 13, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Language Inference with Multi-head Automata through Reinforcement Learning | Oct 20, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning | Aug 10, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement Learning | Mar 29, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning agents with prioritization and parameter noise in continuous state and action space | May 1, 2019 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space | Oct 15, 2024 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Learning Augmented Index Policy for Optimal Service Placement at the Network Edge | Jan 10, 2021 | Q-Learning | —Unverified | 0 |
| Learning Automata Based Q-learning for Content Placement in Cooperative Caching | Mar 30, 2019 | Q-Learning | —Unverified | 0 |
| Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network | May 28, 2020 | Q-Learning | —Unverified | 0 |
| Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia | Sep 6, 2021 | Q-Learning | —Unverified | 0 |
| Learning Best Response Strategies for Agents in Ad Exchanges | Feb 10, 2019 | Q-Learning | —Unverified | 0 |
| Learning Control for Air Hockey Striking using Deep Reinforcement Learning | Feb 26, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Learning Dialog Policies from Weak Demonstrations | Apr 23, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | May 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Learning Explicit Credit Assignment for Multi-agent Joint Q-learning | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing | Sep 16, 2021 | FairnessManagement | —Unverified | 0 |
| Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network | Feb 1, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Gaussian Policies from Smoothed Action Value Functions | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Hard Alignments with Variational Inference | May 16, 2017 | Hard AttentionImage Captioning | —Unverified | 0 |
| Learning in complex action spaces without policy gradients | Oct 8, 2024 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Learning medical triage from clinicians using Deep Q-Learning | Mar 28, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Movement Strategies for Moving Target Defense | Jan 1, 2021 | Q-Learning | —Unverified | 0 |
| Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation | Sep 1, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Negotiating Behavior Between Cars in Intersections using Deep Q-Learning | Oct 24, 2018 | Q-Learning | —Unverified | 0 |
| Learning Neural Control Barrier Functions from Offline Data with Conservatism | May 1, 2025 | Q-Learning | —Unverified | 0 |
| Learning Sampling Policies for Domain Adaptation | May 19, 2018 | ClassificationDomain Adaptation | —Unverified | 0 |
| Learning Self-Awareness Models for Physical Layer Security in Cognitive and AI-enabled Radios | Nov 23, 2022 | Q-Learning | —Unverified | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |