| Applying Reinforcement Learning to Option Pricing and Hedging | Oct 6, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC | Oct 5, 2023 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback | Oct 3, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Using Reinforcement Learning to Optimize Responses in Care Processes: A Case Study on Aggression Incidents | Oct 2, 2023 | Q-Learning | —Unverified | 0 |
| Pre-training with Synthetic Data Helps Offline Reinforcement Learning | Oct 1, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement learning adaptive fuzzy controller for lighting systems: application to aircraft cabin | Sep 30, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Multi-Bellman operator for convergence of Q-learning with linear function approximation | Sep 28, 2023 | Q-Learning | —Unverified | 0 |
| Decoding trust: A reinforcement learning perspective | Sep 26, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning | Sep 21, 2023 | Q-Learning | —Unverified | 0 |
| Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions | Sep 20, 2023 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 |
| Differentiable Quantum Architecture Search for Quantum Reinforcement Learning | Sep 19, 2023 | Q-LearningQuantum Machine Learning | —Unverified | 0 |
| Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions | Sep 18, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications | Sep 18, 2023 | Q-Learning | —Unverified | 0 |
| Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications | Sep 18, 2023 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems | Sep 16, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration | Sep 13, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Dynamic control of self-assembly of quasicrystalline structures through reinforcement learning | Sep 13, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| A Q-learning Approach for Adherence-Aware Recommendations | Sep 12, 2023 | Q-Learning | —Unverified | 0 |
| Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach | Sep 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convex Q Learning in a Stochastic Environment: Extended Version | Sep 10, 2023 | Q-Learning | —Unverified | 0 |
| Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm | Aug 28, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences | Aug 28, 2023 | Image ReconstructionQ-Learning | CodeCode Available | 0 |
| Traffic Light Control with Reinforcement Learning | Aug 28, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning | Aug 24, 2023 | Motion PlanningNavigate | —Unverified | 0 |
| Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi | Aug 20, 2023 | Game of HanabiMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games | Aug 17, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Battery Management in Dairy Farming | Aug 17, 2023 | ManagementQ-Learning | —Unverified | 0 |
| On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing | Aug 15, 2023 | Cloud ComputingCPU | —Unverified | 0 |
| A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control | Aug 10, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Variations on the Reinforcement Learning performance of Blackjack | Aug 9, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Q-Network for Stochastic Process Environments | Aug 7, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence | Aug 7, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Minimax Optimal Q Learning with Nearest Neighbors | Aug 3, 2023 | Q-Learning | —Unverified | 0 |
| Stability of Multi-Agent Learning: Convergence in Network Games with Many Players | Jul 26, 2023 | Q-Learning | —Unverified | 0 |
| Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation | Jul 24, 2023 | GPUQ-Learning | CodeCode Available | 0 |
| Adversarial Agents For Attacking Inaudible Voice Activated Devices | Jul 23, 2023 | CyberBattleSimQ-Learning | —Unverified | 0 |
| A Flexible Framework for Incorporating Patient Preferences Into Q-Learning | Jul 22, 2023 | Q-Learning | —Unverified | 0 |
| Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment | Jul 20, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks | Jul 18, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Meta-Value Learning: a General Framework for Learning with Learning Awareness | Jul 17, 2023 | Q-Learning | CodeCode Available | 0 |
| Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents | Jul 16, 2023 | Learning TheoryQ-Learning | —Unverified | 0 |
| Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach | Jul 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Realtime Spectrum Monitoring via Reinforcement Learning -- A Comparison Between Q-Learning and Heuristic Methods | Jul 11, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Investigating the Edge of Stability Phenomenon in Reinforcement Learning | Jul 9, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| The Value of Chess Squares | Jul 8, 2023 | Game of ChessQ-Learning | —Unverified | 0 |
| Active Collection of Well-Being and Health Data in Mobile Devices | Jul 7, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |