| Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning | Feb 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Enhanced Deep Q-Learning for 2D Self-Driving Cars: Implementation and Evaluation on a Custom Track Environment | Feb 13, 2024 | Q-LearningSelf-Driving Cars | —Unverified | 0 |
| Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks | Feb 12, 2024 | Ensemble LearningManagement | CodeCode Available | 0 |
| Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks | Feb 10, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| ORIENT: A Priority-Aware Energy-Efficient Approach for Latency-Sensitive Applications in 6G | Feb 10, 2024 | Q-Learning | —Unverified | 0 |
| Federated Deep Q-Learning and 5G load balancing | Feb 10, 2024 | Q-Learning | —Unverified | 0 |
| Value function interference and greedy action selection in value-based multi-objective reinforcement learning | Feb 9, 2024 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Attention-Enhanced Prioritized Proximal Policy Optimization for Adaptive Edge Caching | Feb 8, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning | Feb 8, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization | Feb 8, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks | Feb 7, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning | Feb 5, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs | Feb 5, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| SQT -- std Q-target | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| MinMaxMin Q-learning | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 |
| DRL-Based Dynamic Channel Access and SCLAR Maximization for Networks Under Jamming | Feb 2, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching | Feb 1, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway | Feb 1, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game | Feb 1, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles | Jan 31, 2024 | BlockingManagement | —Unverified | 0 |
| Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning | Jan 31, 2024 | Efficient ExplorationModel-based Reinforcement Learning | —Unverified | 0 |
| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Emergence of cooperation under punishment: A reinforcement learning perspective | Jan 29, 2024 | Imitation LearningQ-Learning | —Unverified | 0 |
| Regularized Q-Learning with Linear Function Approximation | Jan 26, 2024 | Decision Making Under UncertaintyQ-Learning | —Unverified | 0 |
| Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation | Jan 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability | Jan 21, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Information-Theoretic State Variable Selection for Reinforcement Learning | Jan 21, 2024 | Decision Makingfeature selection | CodeCode Available | 0 |
| REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes | Jan 16, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Jan 12, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care | Jan 11, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval | Jan 10, 2024 | Q-LearningRhythm | —Unverified | 0 |
| Deep Reinforcement Multi-agent Learning framework for Information Gathering with Local Gaussian Processes for Water Monitoring | Jan 9, 2024 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments | Jan 6, 2024 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids | Jan 5, 2024 | Q-LearningScheduling | —Unverified | 0 |
| The Best Time for an Update: Risk-Sensitive Minimization of Age-Based Metrics | Jan 3, 2024 | Q-Learning | —Unverified | 0 |
| Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach | Jan 1, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach | Dec 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic | Dec 23, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism | Dec 23, 2023 | Distributional Reinforcement LearningQ-Learning | —Unverified | 0 |
| Federated Q-Learning: Linear Regret Speedup with Low Communication Cost | Dec 22, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Maximum entropy GFlowNets with soft Q-learning | Dec 21, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Optimal coordination of resources: A solution from reinforcement learning | Dec 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos | Dec 19, 2023 | Q-Learning | —Unverified | 0 |
| Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown Environments | Dec 19, 2023 | OpenAI GymPathfinder | CodeCode Available | 0 |
| Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge | Dec 19, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |