| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration | Sep 29, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles | Apr 2, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Investigating Reinforcement Learning Agents for Continuous State Space Environments | Aug 8, 2017 | OpenAI GymQ-Learning | —Unverified | 0 |
| Investigating the Edge of Stability Phenomenon in Reinforcement Learning | Jul 9, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach | Mar 6, 2021 | energy managementenergy trading | —Unverified | 0 |
| Investigating the Properties of Neural Network Representations in Reinforcement Learning | Mar 30, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Decentralized model-free reinforcement learning in stochastic games with average-reward objective | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture | Sep 15, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing | Jun 10, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes | Jun 17, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Decentralized Multi-Robot Formation Control Using Reinforcement Learning | Jun 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Q-learning an Ill-posed Problem? | Feb 20, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Is Q-Learning Provably Efficient? An Extended Analysis | Sep 22, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Risk-Sensitive Reinforcement Learning Properly Resolved? | Jul 2, 2023 | Distributional Reinforcement LearningManagement | —Unverified | 0 |
| "Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications | Apr 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks | Jun 26, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Joint Inference of Reward Machines and Policies for Reinforcement Learning | Sep 12, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator | Apr 1, 2018 | Information RetrievalQ-Learning | —Unverified | 0 |
| Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics | Apr 20, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning | Aug 14, 2020 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications | Dec 8, 2023 | Q-LearningScheduling | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes | Feb 21, 2023 | Learning TheoryMedical Diagnosis | —Unverified | 0 |
| Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine | May 24, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| K-spin Hamiltonian for quantum-resolvable Markov decision processes | Apr 13, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Language Inference with Multi-head Automata through Reinforcement Learning | Oct 20, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning | Aug 10, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions | Dec 3, 2015 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement Learning | Mar 29, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles | Nov 30, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning agents with prioritization and parameter noise in continuous state and action space | May 1, 2019 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Autonomous Vehicle Decision-Making Framework for Considering Malicious Behavior at Unsignalized Intersections | Sep 11, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation | Apr 24, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Learning Augmented Index Policy for Optimal Service Placement at the Network Edge | Jan 10, 2021 | Q-Learning | —Unverified | 0 |
| Learning Automata Based Q-learning for Content Placement in Cooperative Caching | Mar 30, 2019 | Q-Learning | —Unverified | 0 |
| Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network | May 28, 2020 | Q-Learning | —Unverified | 0 |
| Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia | Sep 6, 2021 | Q-Learning | —Unverified | 0 |
| Learning Best Response Strategies for Agents in Ad Exchanges | Feb 10, 2019 | Q-Learning | —Unverified | 0 |
| Learning Control for Air Hockey Striking using Deep Reinforcement Learning | Feb 26, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Learning Dialog Policies from Weak Demonstrations | Apr 23, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | May 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Learning Explicit Credit Assignment for Multi-agent Joint Q-learning | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Deep hierarchical reinforcement agents for automated penetration testing | Sep 14, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing | Sep 16, 2021 | FairnessManagement | —Unverified | 0 |
| Algorithmic Trading with Fitted Q Iteration and Heston Model | May 18, 2018 | Algorithmic TradingQ-Learning | —Unverified | 0 |
| Autonomous Penetration Testing using Reinforcement Learning | May 15, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |