| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Privacy Risks in Reinforcement Learning for Household Robots | Jun 15, 2023 | Decision MakingFederated Learning | —Unverified | 0 | 0 |
| Zap Q-Learning | Dec 1, 2017 | Q-Learning | —Unverified | 0 | 0 |
| Zap Q-Learning for Optimal Stopping Time Problems | Apr 25, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Zap Q-Learning With Nonlinear Function Approximation | Oct 11, 2019 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics | Apr 6, 2020 | ClusteringQ-Learning | —Unverified | 0 | 0 |
| Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach | May 3, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Specific investments under negotiated transfer pricing: effects of different surplus sharing parameters on managerial performance: An agent-based simulation with fuzzy Q-learning agents | Mar 25, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Pretrain Soft Q-Learning with Imperfect Demonstrations | May 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem | May 9, 2019 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 | 0 |
| MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning | May 24, 2019 | Decision MakingManagement | —Unverified | 0 | 0 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Feature-Based Q-Learning for Two-Player Stochastic Games | Jun 2, 2019 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| Reinforcement Learning with Non-Markovian Rewards | Dec 5, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| RSRM: Reinforcement Symbolic Regression Machine | May 24, 2023 | MathQ-Learning | —Unverified | 0 | 0 |
| Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems | Aug 2, 2024 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| An Agile Adaptation Method for Multi-mode Vehicle Communication Networks | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response | Aug 4, 2024 | Decision MakingMalware Analysis | —Unverified | 0 | 0 |
| QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction | Aug 6, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Nucleolus Credit Assignment for Effective Coalitions in Multi-agent Reinforcement Learning | Mar 1, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| ShiQ: Bringing back Bellman to LLMs | May 16, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Automatic Reward Shaping from Confounded Offline Data | May 16, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| 3D Simulation for Robot Arm Control with Deep Q-Learning | Sep 13, 2016 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm | Sep 2, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors | Jul 22, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Accelerated Value Iteration via Anderson Mixing | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Accelerating Goal-Directed Reinforcement Learning by Model Characterization | Jan 4, 2019 | modelModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning | Jul 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 | 0 |
| A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles | Aug 4, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market | May 27, 2023 | Portfolio OptimizationQ-Learning | —Unverified | 0 | 0 |
| A Comparative Study of AI-based Intrusion Detection Techniques in Critical Infrastructures | Jul 24, 2020 | Intrusion DetectionManagement | —Unverified | 0 | 0 |
| A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control | Aug 10, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling | May 19, 2017 | ManagementQ-Learning | —Unverified | 0 | 0 |
| A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Aug 15, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| A General Markov Decision Process Framework for Directly Learning Optimal Control Policies | May 28, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills | Apr 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Action Learning for 3D Point Cloud Based Organ Segmentation | Jun 14, 2018 | Organ SegmentationQ-Learning | —Unverified | 0 | 0 |
| Action-modulated midbrain dopamine activity arises from distributed control policies | Jul 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | Jun 24, 2023 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Active Deep Q-learning with Demonstration | Dec 6, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples | Jun 28, 2020 | Active LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Active Inference in Hebbian Learning Networks | Jun 8, 2023 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Active Perception and Representation for Robotic Manipulation | Mar 15, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning | Aug 24, 2023 | Motion PlanningNavigate | —Unverified | 0 | 0 |