| Music Generation using Human-In-The-Loop Reinforcement Learning | Jan 25, 2025 | Music GenerationQ-Learning | —Unverified | 0 |
| Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework | Jan 24, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch | Jan 23, 2025 | Graph AttentionGraph Sampling | —Unverified | 0 |
| Random-Key Algorithms for Optimizing Integrated Operating Room Scheduling | Jan 17, 2025 | Combinatorial OptimizationDecoder | —Unverified | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 |
| SPEQ: Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning | Jan 15, 2025 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| Online inductive learning from answer sets for efficient reinforcement learning exploration | Jan 13, 2025 | Inductive LearningInductive logic programming | —Unverified | 0 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis | Jan 11, 2025 | Q-Learning | —Unverified | 0 |
| Deep Transfer Q-Learning for Offline Non-Stationary Reinforcement Learning | Jan 8, 2025 | Decision MakingInductive Learning | —Unverified | 0 |
| β-DQN: Improving Deep Q-Learning By Evolving the Behavior | Jan 1, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients | Dec 30, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Dynamic Optimization of Storage Systems Using Reinforcement Learning Techniques | Dec 29, 2024 | CPUQ-Learning | —Unverified | 0 |
| Protein Structure Prediction in the 3D HP Model Using Deep Reinforcement Learning | Dec 29, 2024 | Deep Reinforcement LearningProtein Structure Prediction | —Unverified | 0 |
| A Reinforcement Learning-Based Task Mapping Method to Improve the Reliability of Clustered Manycores | Dec 26, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks | Dec 22, 2024 | Q-Learning | —Unverified | 0 |
| Decoding fairness: a reinforcement learning perspective | Dec 20, 2024 | FairnessImitation Learning | CodeCode Available | 0 |
| MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control | Dec 20, 2024 | Graph AttentionQ-Learning | CodeCode Available | 0 |
| Distribution-Free Uncertainty Quantification in Mechanical Ventilation Treatment: A Conformal Deep Q-Learning Framework | Dec 17, 2024 | Conformal PredictionDeep Reinforcement Learning | —Unverified | 0 |
| Neural-Network-Driven Reward Prediction as a Heuristic: Advancing Q-Learning for Mobile Robot Path Planning | Dec 17, 2024 | Q-Learning | —Unverified | 0 |
| Integrated trucks assignment and scheduling problem with mixed service mode docks: A Q-learning based adaptive large neighborhood search algorithm | Dec 12, 2024 | Q-LearningScheduling | —Unverified | 0 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios | Dec 9, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Demonstration Selection for In-Context Learning via Reinforcement Learning | Dec 5, 2024 | ClassificationDiversity | —Unverified | 0 |
| Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support | Dec 3, 2024 | Computational EfficiencyFairness | —Unverified | 0 |
| Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning | Dec 1, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Q-learning-based Model-free Safety Filter | Nov 29, 2024 | modelQ-Learning | —Unverified | 0 |
| Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management | Nov 27, 2024 | Decision MakingManagement | —Unverified | 0 |
| Time-Scale Separation in Q-Learning: Extending TD() for Action-Value Function Decomposition | Nov 21, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise | Nov 20, 2024 | Q-Learning | —Unverified | 0 |
| Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning | Nov 18, 2024 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning | Nov 17, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Coverage Analysis for Digital Cousin Selection -- Improving Multi-Environment Q-Learning | Nov 13, 2024 | Q-Learning | —Unverified | 0 |
| Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization | Nov 12, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning | Nov 12, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind | Nov 11, 2024 | Q-Learning | CodeCode Available | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning for Adaptive Resource Scheduling in Complex System Environments | Nov 8, 2024 | Cloud ComputingEdge-computing | —Unverified | 0 |
| Asymptotic regularity of a generalised stochastic Halpern scheme with applications | Nov 7, 2024 | Q-LearningStochastic Optimization | —Unverified | 0 |
| Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning | Nov 7, 2024 | Offline RLPolicy Gradient Methods | —Unverified | 0 |
| Maximizing User Connectivity in AI-Enabled Multi-UAV Networks: A Distributed Strategy Generalized to Arbitrary User Distributions | Nov 7, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning | Nov 7, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Temporal-Difference Learning Using Distributed Error Signals | Nov 6, 2024 | Q-Learning | CodeCode Available | 0 |
| Simulation of Nanorobots with Artificial Intelligence and Reinforcement Learning for Advanced Cancer Cell Detection and Tracking | Nov 4, 2024 | Cell DetectionNavigate | CodeCode Available | 0 |
| Regret of exploratory policy improvement and q-learning | Nov 2, 2024 | Q-Learning | —Unverified | 0 |
| HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search | Nov 1, 2024 | Q-Learning | —Unverified | 0 |