| Numeric Reward Machines | Apr 30, 2024 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning Problem Solving with Large Language Models | Apr 29, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Using Deep Q-Learning to Dynamically Toggle between Push/Pull Actions in Computational Trust Mechanisms | Apr 28, 2024 | Q-Learning | —Unverified | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Recursive Backwards Q-Learning in Deterministic Environments | Apr 24, 2024 | Q-Learning | —Unverified | 0 |
| Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation | Apr 24, 2024 | Q-LearningScheduling | —Unverified | 0 |
| AFU: Actor-Free critic Updates in off-policy RL for continuous control | Apr 24, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Unified ODE Analysis of Smooth Q-Learning Algorithms | Apr 20, 2024 | Q-Learning | —Unverified | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 |
| Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement | Apr 12, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA | Apr 9, 2024 | Q-Learning | —Unverified | 0 |
| Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks | Apr 9, 2024 | Q-LearningTraffic Signal Control | —Unverified | 0 |
| Deep Reinforcement Learning Control for Disturbance Rejection in a Nonlinear Dynamic System with Parametric Uncertainty | Apr 6, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution | Apr 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Superior Genetic Algorithms for the Target Set Selection Problem Based on Power-Law Parameter Choices and Simple Greedy Heuristics | Apr 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Data-Driven Knowledge Transfer in Batch Q^* Learning | Apr 1, 2024 | Decision MakingMarketing | —Unverified | 0 |
| Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning | Mar 31, 2024 | Atari GamesQ-Learning | —Unverified | 0 |
| EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning | Mar 29, 2024 | NavigateQ-Learning | —Unverified | 0 |
| From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Mar 27, 2024 | Autonomous NavigationDecision Making | CodeCode Available | 0 |
| Compressed Federated Reinforcement Learning with a Generative Model | Mar 26, 2024 | modelQ-Learning | CodeCode Available | 0 |
| DASA: Delay-Adaptive Multi-Agent Stochastic Approximation | Mar 25, 2024 | AvgQ-Learning | —Unverified | 0 |
| Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints | Mar 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services | Mar 23, 2024 | FairnessQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Mar 20, 2024 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Neural-Kernel Conditional Mean Embeddings | Mar 16, 2024 | Deep LearningDensity Estimation | —Unverified | 0 |
| A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning | Mar 14, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning | Mar 13, 2024 | Q-Learning | —Unverified | 0 |
| Strategizing against Q-learners: A Control-theoretical Approach | Mar 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services | Mar 12, 2024 | Q-Learning | —Unverified | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Scalable Online Exploration via Coverability | Mar 11, 2024 | Efficient ExplorationQ-Learning | CodeCode Available | 0 |
| Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach | Mar 11, 2024 | Q-Learning | —Unverified | 0 |
| Algorithmic Collusion and Price Discrimination: The Over-Usage of Data | Mar 10, 2024 | Q-Learning | —Unverified | 0 |
| Enhancing Classification Performance via Reinforcement Learning for Feature Selection | Mar 9, 2024 | Classificationfeature selection | —Unverified | 0 |
| Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations | Mar 6, 2024 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition | Mar 4, 2024 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| QF-tuner: Breaking Tradition in Reinforcement Learning | Feb 26, 2024 | OpenAI GymQ-Learning | —Unverified | 0 |
| SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning | Feb 20, 2024 | Imitation LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Optimal Execution when Liquidity is Time-Varying | Feb 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling | Feb 19, 2024 | AvgMulti-agent Reinforcement Learning | —Unverified | 0 |
| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model | Feb 19, 2024 | modelQ-Learning | —Unverified | 0 |
| Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization | Feb 19, 2024 | counterfactualOpenAI Gym | —Unverified | 0 |
| Reinforcement learning to maximise wind turbine energy generation | Feb 17, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks | Feb 14, 2024 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties | Feb 13, 2024 | Decision MakingManagement | —Unverified | 0 |