| Convex Q Learning in a Stochastic Environment: Extended Version | Sep 10, 2023 | Q-Learning | —Unverified | 0 |
| Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences | Aug 28, 2023 | Image ReconstructionQ-Learning | CodeCode Available | 0 |
| Traffic Light Control with Reinforcement Learning | Aug 28, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm | Aug 28, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning | Aug 24, 2023 | Motion PlanningNavigate | —Unverified | 0 |
| Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi | Aug 20, 2023 | Game of HanabiMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games | Aug 17, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Battery Management in Dairy Farming | Aug 17, 2023 | ManagementQ-Learning | —Unverified | 0 |
| On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing | Aug 15, 2023 | Cloud ComputingCPU | —Unverified | 0 |
| A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control | Aug 10, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Variations on the Reinforcement Learning performance of Blackjack | Aug 9, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Q-Network for Stochastic Process Environments | Aug 7, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence | Aug 7, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Minimax Optimal Q Learning with Nearest Neighbors | Aug 3, 2023 | Q-Learning | —Unverified | 0 |
| Robust Multi-Agent Reinforcement Learning with State Uncertainty | Jul 30, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Stability of Multi-Agent Learning: Convergence in Network Games with Many Players | Jul 26, 2023 | Q-Learning | —Unverified | 0 |
| Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation | Jul 24, 2023 | GPUQ-Learning | CodeCode Available | 0 |
| Adversarial Agents For Attacking Inaudible Voice Activated Devices | Jul 23, 2023 | CyberBattleSimQ-Learning | —Unverified | 0 |
| A Flexible Framework for Incorporating Patient Preferences Into Q-Learning | Jul 22, 2023 | Q-Learning | —Unverified | 0 |
| Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment | Jul 20, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks | Jul 18, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Meta-Value Learning: a General Framework for Learning with Learning Awareness | Jul 17, 2023 | Q-Learning | CodeCode Available | 0 |
| Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents | Jul 16, 2023 | Learning TheoryQ-Learning | —Unverified | 0 |
| Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach | Jul 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Realtime Spectrum Monitoring via Reinforcement Learning -- A Comparison Between Q-Learning and Heuristic Methods | Jul 11, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Investigating the Edge of Stability Phenomenon in Reinforcement Learning | Jul 9, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| The Value of Chess Squares | Jul 8, 2023 | Game of ChessQ-Learning | —Unverified | 0 |
| Active Collection of Well-Being and Health Data in Mobile Devices | Jul 7, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Offline Reinforcement Learning with Imbalanced Datasets | Jul 6, 2023 | D4RLOffline RL | —Unverified | 0 |
| Elastic Decision Transformer | Jul 5, 2023 | Atari GamesD4RL | —Unverified | 0 |
| Stability of Q-Learning Through Design and Optimism | Jul 5, 2023 | Q-Learning | —Unverified | 0 |
| LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning | Jul 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning | Jul 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Risk-Sensitive Reinforcement Learning Properly Resolved? | Jul 2, 2023 | Distributional Reinforcement LearningManagement | —Unverified | 0 |
| Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective | Jun 29, 2023 | Feature EngineeringQ-Learning | CodeCode Available | 0 |
| Evaluation of Reinforcement Learning Techniques for Trading on a Diverse Portfolio | Jun 28, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Continuous-time q-learning for mean-field control problems | Jun 28, 2023 | Q-Learning | —Unverified | 0 |
| Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning | Jun 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| RansomAI: AI-powered Ransomware for Stealthy Encryption | Jun 27, 2023 | Q-LearningRaspberry Pi 4 | —Unverified | 0 |
| Decentralized Multi-Robot Formation Control Using Reinforcement Learning | Jun 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | Jun 24, 2023 | Atari GamesDecision Making | —Unverified | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation | Jun 20, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm | Jun 17, 2023 | Atari GamesQ-Learning | —Unverified | 0 |
| Algorithmic Collusion in Auctions: Evidence from Controlled Laboratory Experiments | Jun 15, 2023 | Q-Learning | —Unverified | 0 |
| Joint Path planning and Power Allocation of a Cellular-Connected UAV using Apprenticeship Learning via Deep Inverse Reinforcement Learning | Jun 15, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Residual Q-Learning: Offline and Online Policy Customization without Value | Jun 15, 2023 | Imitation LearningQ-Learning | —Unverified | 0 |
| Privacy Risks in Reinforcement Learning for Household Robots | Jun 15, 2023 | Decision MakingFederated Learning | —Unverified | 0 |