| Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm | Sep 2, 2024 | Q-Learning | —Unverified | 0 |
| Imitating Language via Scalable Inverse Reinforcement Learning | Sep 2, 2024 | DiversityImitation Learning | —Unverified | 0 |
| The Sample-Communication Complexity Trade-off in Federated Q-Learning | Aug 30, 2024 | Q-Learning | —Unverified | 0 |
| Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization | Aug 29, 2024 | Q-Learning | —Unverified | 0 |
| On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes | Aug 29, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Dynamic operator management in meta-heuristics using reinforcement learning: an application to permutation flowshop scheduling problems | Aug 27, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning | Aug 26, 2024 | Contrastive LearningQ-Learning | —Unverified | 0 |
| Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Aug 23, 2024 | HallucinationPrompt Engineering | —Unverified | 0 |
| Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning | Aug 21, 2024 | Q-Learning | —Unverified | 0 |
| GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits | Aug 19, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication | Aug 17, 2024 | Collision AvoidanceQ-Learning | —Unverified | 0 |
| A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Aug 15, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity | Aug 13, 2024 | Q-Learning | —Unverified | 0 |
| A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm | Aug 9, 2024 | Q-Learning | —Unverified | 0 |
| Crowd Intelligence for Early Misinformation Prediction on Social Media | Aug 8, 2024 | Fact CheckingMisinformation | CodeCode Available | 0 |
| QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction | Aug 6, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach | Aug 6, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Whittle's index-based age-of-information minimization in multi-energy harvesting source networks | Aug 5, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response | Aug 4, 2024 | Decision MakingMalware Analysis | —Unverified | 0 |
| Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems | Aug 2, 2024 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network | Jul 31, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Evolution of cooperation with Q-learning: the impact of information perception | Jul 29, 2024 | DiversityQ-Learning | —Unverified | 0 |
| Evolution of cooperation in the public goods game with Q-learning | Jul 29, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning for Energy Efficient Multi-Hop STAR-RIS-Assisted Transmissions | Jul 26, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| QT-TDM: Planning With Transformer Dynamics Model and Autoregressive Q-Learning | Jul 26, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Principal-Agent Reinforcement Learning: Orchestrating AI Agents with Contracts | Jul 25, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Long-term Fairness in Ride-Hailing Platform | Jul 25, 2024 | FairnessQ-Learning | —Unverified | 0 |
| In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning | Jul 23, 2024 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search | Jul 22, 2024 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN | Jul 22, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL | Jul 20, 2024 | Few-Shot Text ClassificationQ-Learning | CodeCode Available | 0 |
| Coverage-aware and Reinforcement Learning Using Multi-agent Approach for HD Map QoS in a Realistic Environment | Jul 19, 2024 | Q-Learning | —Unverified | 0 |
| An Agile Adaptation Method for Multi-mode Vehicle Communication Networks | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Multi-Objective Optimization: Enhancing Wind Turbine Energy Generation while Mitigating Noise Emissions | Jul 18, 2024 | Deep Reinforcement LearningPitch control | —Unverified | 0 |
| Solving the Model Unavailable MARE using Q-Learning Algorithm | Jul 18, 2024 | Q-Learning | —Unverified | 0 |
| Optimistic Q-learning for average reward and episodic reinforcement learning | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Misspecified Q-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error | Jul 18, 2024 | Q-Learning | —Unverified | 0 |
| Exploration in Knowledge Transfer Utilizing Reinforcement Learning | Jul 15, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Reward Shaping for Multi-Agent Pathfinding | Jul 15, 2024 | Collision AvoidanceMulti-agent Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning in High-frequency Market Making | Jul 14, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization | Jul 12, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| PID Accelerated Temporal Difference Algorithms | Jul 11, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Periodic agent-state based Q-learning for POMDPs | Jul 8, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Simplifying Deep Temporal Difference Learning | Jul 5, 2024 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 3 |
| Unified continuous-time q-learning for mean-field game and mean-field control problems | Jul 5, 2024 | Q-Learning | —Unverified | 0 |
| A Multi-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games | Jul 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Robust Q-Learning for finite ambiguity sets | Jul 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Configuring Transmission Thresholds in IIoT Alarm Scenarios for Energy-Efficient Event Reporting | Jul 4, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation | Jul 4, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 1 |