| Double Deep Q-Learning in Opponent Modeling | Nov 24, 2022 | Mixture-of-ExpertsQ-Learning | —Unverified | 0 |
| Learning Self-Awareness Models for Physical Layer Security in Cognitive and AI-enabled Radios | Nov 23, 2022 | Q-Learning | —Unverified | 0 |
| Reinforcement Causal Structure Learning on Order Graph | Nov 22, 2022 | Causal DiscoveryQ-Learning | —Unverified | 0 |
| Simultaneously Updating All Persistence Values in Reinforcement Learning | Nov 21, 2022 | AllAtari Games | —Unverified | 0 |
| Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks | Nov 21, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Credit-cognisant reinforcement learning for multi-agent cooperation | Nov 18, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit | Nov 18, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Reinforcement Learning Approach for Process Parameter Optimization in Additive Manufacturing | Nov 17, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Planning Irregular Object Packing via Hierarchical Reinforcement Learning | Nov 17, 2022 | Hierarchical Reinforcement LearningObject | —Unverified | 0 |
| Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning | Nov 16, 2022 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Exploratory Control with Tsallis Entropy for Latent Factor Models | Nov 14, 2022 | Q-Learning | —Unverified | 0 |
| On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization | Nov 14, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning in Non-Markovian Environments | Nov 3, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 |
| DynamicLight: Two-Stage Dynamic Traffic Signal Timing | Nov 2, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Deep Reinforcement Learning for Power Control in Next-Generation WiFi Network Systems | Nov 2, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Sufficient Exploration for Convex Q-learning | Oct 17, 2022 | OpenAI GymQ-Learning | —Unverified | 0 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time | Oct 14, 2022 | Q-Learning | —Unverified | 0 |
| Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings | Oct 13, 2022 | Deep Reinforcement LearningIndoor Localization | —Unverified | 0 |
| Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles | Oct 12, 2022 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System | Oct 12, 2022 | Deep LearningFault Detection | —Unverified | 0 |
| Factors of Influence of the Overestimation Bias of Q-Learning | Oct 11, 2022 | Q-Learning | CodeCode Available | 0 |
| Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems | Oct 7, 2022 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders | Oct 3, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot | Oct 1, 2022 | Q-Learning | —Unverified | 0 |
| Bayesian Q-learning With Imperfect Expert Demonstrations | Oct 1, 2022 | Atari GamesQ-Learning | —Unverified | 0 |
| On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| Application of Deep Q Learning with Simulation Results for Elevator Optimization | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| Efficient LSTM Training with Eligibility Traces | Sep 30, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning | Sep 28, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations | Sep 28, 2022 | Autonomous DrivingEdge-computing | —Unverified | 0 |
| Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective | Sep 26, 2022 | Imitation LearningMulti-Goal Reinforcement Learning | —Unverified | 0 |
| Comparative Study of Q-Learning and NeuroEvolution of Augmenting Topologies for Self Driving Agents | Sep 19, 2022 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning | Sep 17, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| M^2DQN: A Robust Method for Accelerating Deep Q-learning Network | Sep 16, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning-Based Cooperative P2P Power Trading between DC Nanogrid Clusters with Wind and PV Energy Resources | Sep 16, 2022 | energy tradingManagement | —Unverified | 0 |
| IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture | Sep 15, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Reinforcement Learning for Task Offloading in UAV-Aided Smart Farm Networks | Sep 15, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Structured Q-learning For Antibody Design | Sep 10, 2022 | Combinatorial OptimizationMolecular Docking | —Unverified | 0 |
| Route Planning for Last-Mile Deliveries Using Mobile Parcel Lockers: A Hybrid Q-Learning Network Approach | Sep 9, 2022 | Q-Learning | CodeCode Available | 0 |
| Reward Delay Attacks on Deep Reinforcement Learning | Sep 8, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Double Q-Learning for Citizen Relocation During Natural Hazards | Sep 8, 2022 | Q-Learning | —Unverified | 0 |
| On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs | Sep 7, 2022 | Open-Ended Question AnsweringQ-Learning | —Unverified | 0 |
| SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions | Sep 5, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |