| Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems | Nov 10, 2023 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models | Nov 9, 2023 | Algorithmic TradingQ-Learning | —Unverified | 0 |
| Pointer Networks with Q-Learning for Combinatorial Optimization | Nov 5, 2023 | Combinatorial OptimizationGraph Embedding | —Unverified | 0 |
| Optimistic Multi-Agent Policy Gradient | Nov 3, 2023 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments | Oct 31, 2023 | Q-LearningQuantization | —Unverified | 0 |
| DGFN: Double Generative Flow Networks | Oct 30, 2023 | Drug DiscoveryQ-Learning | —Unverified | 0 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Weakly Coupled Deep Q-Networks | Oct 28, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-free Posterior Sampling via Learning Rate Randomization | Oct 27, 2023 | modelQ-Learning | —Unverified | 0 |
| Lifting the Veil: Unlocking the Power of Depth in Q-learning | Oct 27, 2023 | Learning TheoryManagement | —Unverified | 0 |
| Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations | Oct 25, 2023 | Q-Learning | —Unverified | 0 |
| Reinforcement learning based local path planning for mobile robot | Oct 24, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ε-Greedy Exploration | Oct 24, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| AI on the Water: Applying DRL to Autonomous Vessel Navigation | Oct 23, 2023 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions | Oct 19, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Oct 19, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Bad Values but Good Behavior: Learning Highly Misspecified Bandits and MDPs | Oct 13, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach | Oct 12, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Integrated Sensing and Communication Neighbor Discovery for MANET with Gossip Mechanism | Oct 11, 2023 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning | Oct 10, 2023 | Imitation LearningQ-Learning | —Unverified | 0 |
| Suppressing Overestimation in Q-Learning through Adversarial Behaviors | Oct 10, 2023 | Q-Learning | —Unverified | 0 |
| Boosting Continuous Control with Consistency Policy | Oct 10, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Dynamic value alignment through preference aggregation of multiple objectives | Oct 9, 2023 | Q-Learning | —Unverified | 0 |
| DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Oct 8, 2023 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network | Oct 7, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation | Oct 7, 2023 | Q-Learning | —Unverified | 0 |
| Applying Reinforcement Learning to Option Pricing and Hedging | Oct 6, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC | Oct 5, 2023 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| PGDQN: Preference-Guided Deep Q-Network | Oct 3, 2023 | Atari GamesBenchmarking | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback | Oct 3, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Using Reinforcement Learning to Optimize Responses in Care Processes: A Case Study on Aggression Incidents | Oct 2, 2023 | Q-Learning | —Unverified | 0 |
| Pre-training with Synthetic Data Helps Offline Reinforcement Learning | Oct 1, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement learning adaptive fuzzy controller for lighting systems: application to aircraft cabin | Sep 30, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Multi-Bellman operator for convergence of Q-learning with linear function approximation | Sep 28, 2023 | Q-Learning | —Unverified | 0 |
| Decoding trust: A reinforcement learning perspective | Sep 26, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning | Sep 21, 2023 | Q-Learning | —Unverified | 0 |
| Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions | Sep 20, 2023 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 |
| Differentiable Quantum Architecture Search for Quantum Reinforcement Learning | Sep 19, 2023 | Q-LearningQuantum Machine Learning | —Unverified | 0 |
| Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications | Sep 18, 2023 | Q-Learning | —Unverified | 0 |
| Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions | Sep 18, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications | Sep 18, 2023 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems | Sep 16, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration | Sep 13, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Dynamic control of self-assembly of quasicrystalline structures through reinforcement learning | Sep 13, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Sep 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| A Q-learning Approach for Adherence-Aware Recommendations | Sep 12, 2023 | Q-Learning | —Unverified | 0 |
| Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach | Sep 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |