| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A new convergent variant of Q-learning with linear function approximation | Dec 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle | Oct 27, 2020 | energy managementManagement | —Unverified | 0 | 0 |
| Investigating Reinforcement Learning Agents for Continuous State Space Environments | Aug 8, 2017 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Investigating the Edge of Stability Phenomenon in Reinforcement Learning | Jul 9, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Causal Mean Field Multi-Agent Reinforcement Learning | Feb 20, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Investigating the Properties of Neural Network Representations in Reinforcement Learning | Mar 30, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning | Jun 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture | Sep 15, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning | Oct 26, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Causal Deep Reinforcement Learning Using Observational Data | Nov 28, 2022 | Autonomous DrivingCausal Inference | —Unverified | 0 | 0 |
| A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward | Sep 24, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization | Jul 1, 2024 | Deep Reinforcement Learningenergy management | —Unverified | 0 | 0 |
| EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning | Mar 29, 2024 | NavigateQ-Learning | —Unverified | 0 | 0 |
| Encoders and Decoders for Quantum Expander Codes Using Machine Learning | Sep 6, 2019 | BIG-bench Machine LearningDecoder | —Unverified | 0 | 0 |
| Is Risk-Sensitive Reinforcement Learning Properly Resolved? | Jul 2, 2023 | Distributional Reinforcement LearningManagement | —Unverified | 0 | 0 |
| "Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications | Apr 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms | Feb 7, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 | 0 |
| Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator | Apr 1, 2018 | Information RetrievalQ-Learning | —Unverified | 0 | 0 |
| Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics | Apr 20, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Empirical Q-Value Iteration | Nov 30, 2014 | Q-Learning | —Unverified | 0 | 0 |
| An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS | May 26, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 | 0 |
| Empirically Evaluating Multiagent Learning Algorithms | Jan 31, 2014 | Q-Learning | —Unverified | 0 | 0 |
| Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine | May 24, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 | 0 |
| K-spin Hamiltonian for quantum-resolvable Markov decision processes | Apr 13, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Language Inference with Multi-head Automata through Reinforcement Learning | Oct 20, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning | Aug 10, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring | Jul 18, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement Learning | Mar 29, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Catalytic evolution of cooperation in a population with behavioural bimodality | Jun 17, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Emergence of cooperation under punishment: A reinforcement learning perspective | Jan 29, 2024 | Imitation LearningQ-Learning | —Unverified | 0 | 0 |
| Emergence of Addictive Behaviors in Reinforcement Learning Agents | Nov 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network | May 2, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Network Simulation of OTC Markets with Multiple Agents | May 3, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Learning Automata Based Q-learning for Content Placement in Cooperative Caching | Mar 30, 2019 | Q-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support | May 11, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia | Sep 6, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Learning Best Response Strategies for Agents in Ad Exchanges | Feb 10, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Learning Control for Air Hockey Striking using Deep Reinforcement Learning | Feb 26, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors | Jul 22, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Learning Dialog Policies from Weak Demonstrations | Apr 23, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | May 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Learning Explicit Credit Assignment for Multi-agent Joint Q-learning | Sep 29, 2021 | Q-Learning | —Unverified | 0 | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 | 0 |
| Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing | Sep 16, 2021 | FairnessManagement | —Unverified | 0 | 0 |
| Elastic Decision Transformer | Jul 5, 2023 | Atari GamesD4RL | —Unverified | 0 | 0 |
| Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach | Sep 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |