| Exploratory Control with Tsallis Entropy for Latent Factor Models | Nov 14, 2022 | Q-Learning | —Unverified | 0 |
| On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization | Nov 14, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning in Non-Markovian Environments | Nov 3, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 |
| Deep Reinforcement Learning for Power Control in Next-Generation WiFi Network Systems | Nov 2, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| DynamicLight: Two-Stage Dynamic Traffic Signal Timing | Nov 2, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Solving Continuous Control via Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Sufficient Exploration for Convex Q-learning | Oct 17, 2022 | OpenAI GymQ-Learning | —Unverified | 0 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time | Oct 14, 2022 | Q-Learning | —Unverified | 0 |
| Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings | Oct 13, 2022 | Deep Reinforcement LearningIndoor Localization | —Unverified | 0 |
| Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient | Oct 13, 2022 | Montezuma's RevengeQ-Learning | CodeCode Available | 1 |
| Sustainable Online Reinforcement Learning for Auto-bidding | Oct 13, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles | Oct 12, 2022 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System | Oct 12, 2022 | Deep LearningFault Detection | —Unverified | 0 |
| Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials | Oct 11, 2022 | Offline RLQ-Learning | CodeCode Available | 1 |
| Factors of Influence of the Overestimation Bias of Q-Learning | Oct 11, 2022 | Q-Learning | CodeCode Available | 0 |
| Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems | Oct 7, 2022 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders | Oct 3, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Bayesian Q-learning With Imperfect Expert Demonstrations | Oct 1, 2022 | Atari GamesQ-Learning | —Unverified | 0 |
| Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot | Oct 1, 2022 | Q-Learning | —Unverified | 0 |
| Application of Deep Q Learning with Simulation Results for Elevator Optimization | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| Efficient LSTM Training with Eligibility Traces | Sep 30, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Robust Q-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty | Sep 30, 2022 | Q-Learning | CodeCode Available | 1 |
| On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning | Sep 28, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations | Sep 28, 2022 | Autonomous DrivingEdge-computing | —Unverified | 0 |
| Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective | Sep 26, 2022 | Imitation LearningMulti-Goal Reinforcement Learning | —Unverified | 0 |
| Revisiting Discrete Soft Actor-Critic | Sep 21, 2022 | Atari GamesQ-Learning | CodeCode Available | 1 |
| MAN: Multi-Action Networks Learning | Sep 19, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Comparative Study of Q-Learning and NeuroEvolution of Augmenting Topologies for Self Driving Agents | Sep 19, 2022 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning | Sep 17, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning-Based Cooperative P2P Power Trading between DC Nanogrid Clusters with Wind and PV Energy Resources | Sep 16, 2022 | energy tradingManagement | —Unverified | 0 |
| M^2DQN: A Robust Method for Accelerating Deep Q-learning Network | Sep 16, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture | Sep 15, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Reinforcement Learning for Task Offloading in UAV-Aided Smart Farm Networks | Sep 15, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Structured Q-learning For Antibody Design | Sep 10, 2022 | Combinatorial OptimizationMolecular Docking | —Unverified | 0 |
| Route Planning for Last-Mile Deliveries Using Mobile Parcel Lockers: A Hybrid Q-Learning Network Approach | Sep 9, 2022 | Q-Learning | CodeCode Available | 0 |
| Reward Delay Attacks on Deep Reinforcement Learning | Sep 8, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Double Q-Learning for Citizen Relocation During Natural Hazards | Sep 8, 2022 | Q-Learning | —Unverified | 0 |
| On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs | Sep 7, 2022 | Open-Ended Question AnsweringQ-Learning | —Unverified | 0 |
| SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions | Sep 5, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning | Sep 1, 2022 | Board GamesQ-Learning | —Unverified | 0 |
| Partial Counterfactual Identification for Infinite Horizon Partially Observable Markov Decision Process | Aug 31, 2022 | counterfactualQ-Learning | —Unverified | 0 |
| Direct Data-Driven Discrete-time Bilinear Biquadratic Regulator | Aug 29, 2022 | Q-Learning | —Unverified | 0 |