| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 |
| Artificial Intelligence and Algorithmic Price Collusion in Two-sided Markets | Jul 4, 2024 | Q-Learning | —Unverified | 0 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Two-Step Q-Learning | Jul 2, 2024 | Q-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization | Jul 1, 2024 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 |
| Towards Secure and Efficient Data Scheduling for Vehicular Social Networks | Jun 28, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors | Jun 28, 2024 | Car RacingQ-Learning | CodeCode Available | 0 |
| Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks | Jun 26, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Boosting Soft Q-Learning by Bounding | Jun 26, 2024 | Q-Learning | CodeCode Available | 0 |
| MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention | Jun 24, 2024 | Imitation LearningQ-Learning | —Unverified | 0 |
| A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms | Jun 20, 2024 | Learning TheoryQ-Learning | —Unverified | 0 |
| Learning to Select Goals in Automated Planning with Deep-Q Learning | Jun 20, 2024 | Q-Learning | —Unverified | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| EduQate: Generating Adaptive Curricula through RMABs in Education Settings | Jun 20, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry | Jun 18, 2024 | Q-Learning | CodeCode Available | 0 |
| Catalytic evolution of cooperation in a population with behavioural bimodality | Jun 17, 2024 | Q-Learning | —Unverified | 0 |
| Optimal Transport-Assisted Risk-Sensitive Q-Learning | Jun 17, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Jun 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Finite-Time Analysis of Simultaneous Double Q-learning | Jun 14, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors | Jun 12, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation | Jun 12, 2024 | Q-Learning | CodeCode Available | 0 |
| PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer | Jun 10, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network | Jun 7, 2024 | Q-Learning | —Unverified | 0 |
| Online Frequency Scheduling by Learning Parallel Actions | Jun 7, 2024 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Strategically Conservative Q-Learning | Jun 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Bootstrapping Expectiles in Reinforcement Learning | Jun 6, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks | Jun 4, 2024 | PhilosophyQ-Learning | —Unverified | 0 |
| Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Towards Universal and Black-Box Query-Response Only Attack on LLMs with QROA | Jun 4, 2024 | Q-Learning | CodeCode Available | 1 |
| Tabular and Deep Learning for the Whittle Index | Jun 4, 2024 | Deep LearningQ-Learning | —Unverified | 0 |
| How to discretize continuous state-action spaces in Q-learning: A symbolic control approach | Jun 3, 2024 | Q-Learning | —Unverified | 0 |
| Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation | May 31, 2024 | Q-Learning | CodeCode Available | 0 |
| Approximate Global Convergence of Independent Learning in Multi-Agent Systems | May 30, 2024 | Q-Learning | —Unverified | 0 |
| Q-learning as a monotone scheme | May 30, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost | May 29, 2024 | Q-Learning | —Unverified | 0 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | May 28, 2024 | Autonomous DrivingBilevel Optimization | CodeCode Available | 2 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Highway Reinforcement Learning | May 28, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Mutation-Bias Learning in Games | May 28, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning | May 27, 2024 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games | May 27, 2024 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning for Jump-Diffusions, with Financial Applications | May 26, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS | May 26, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning | May 24, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine | May 24, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning | May 24, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |