| A Statistical Analysis of Polyak-Ruppert Averaged Q-learning | Dec 29, 2021 | Q-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 | 5 |
| Control with adaptive Q-learning | Nov 3, 2020 | OpenAI GymQ-Learning | CodeCode Available | 0 | 5 |
| Adversarial Learning of a Sampler Based on an Unnormalized Distribution | Jan 3, 2019 | FormQ-Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Control of Probabilistic Boolean Networks | Sep 7, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services | Mar 23, 2024 | FairnessQ-Learning | CodeCode Available | 0 | 5 |
| Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation | Jun 12, 2024 | Q-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Provably efficient RL with Rich Observations via Latent State Decoding | Jan 25, 2019 | ClusteringQ-Learning | CodeCode Available | 0 | 5 |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Oct 27, 2024 | D4RLQ-Learning | CodeCode Available | 0 | 5 |
| QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds | Dec 13, 2017 | BenchmarkingModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 | 5 |
| Q-Learning Lagrange Policies for Multi-Action Restless Bandits | Jun 22, 2021 | Multi-Armed BanditsQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Quality-Value (DQV) Learning | Sep 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Oct 8, 2023 | Autonomous DrivingQ-Learning | CodeCode Available | 0 | 5 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 | 5 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Deep Q-learning from Demonstrations | Apr 12, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft | Mar 11, 2019 | MinecraftQ-Learning | CodeCode Available | 0 | 5 |
| Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences | Aug 28, 2023 | Image ReconstructionQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 | 5 |
| Deep Q-learning: a robust control approach | Jan 21, 2022 | OpenAI GymQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q learning for fooling neural networks | Nov 13, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 | 5 |
| A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem | May 25, 2021 | PositionQ-Learning | CodeCode Available | 0 | 5 |
| Deep Ordinal Reinforcement Learning | May 6, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Revisiting Prioritized Experience Replay: A Value Perspective | Feb 5, 2021 | Atari GamesQ-Learning | CodeCode Available | 0 | 5 |
| Revisiting the Softmax Bellman Operator: New Benefits and New Perspective | Dec 2, 2018 | Atari GamesQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q-Learning for Nash Equilibria: Nash-DQN | Apr 23, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Automatic Data Augmentation by Learning the Deterministic Policy | Oct 18, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Crowd Intelligence for Early Misinformation Prediction on Social Media | Aug 8, 2024 | Fact CheckingMisinformation | CodeCode Available | 0 | 5 |
| A Kernel Loss for Solving the Bellman Equation | May 25, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots | Aug 3, 2021 | Model Predictive ControlMotion Planning | CodeCode Available | 0 | 5 |
| CytonRL: an Efficient Reinforcement Learning Open-source Toolkit Implemented in C++ | Apr 14, 2018 | GPUQ-Learning | CodeCode Available | 0 | 5 |
| Automata Learning meets Shielding | Dec 4, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Adaptive Symmetric Reward Noising for Reinforcement Learning | May 24, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 | 5 |
| Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera | Mar 13, 2023 | Privacy PreservingQ-Learning | CodeCode Available | 0 | 5 |
| SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference | Oct 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution | Jul 2, 2015 | Q-LearningSelf-Learning | CodeCode Available | 0 | 5 |
| Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings | Oct 29, 2020 | Change Point DetectionOff-policy evaluation | CodeCode Available | 0 | 5 |
| Decoding fairness: a reinforcement learning perspective | Dec 20, 2024 | FairnessImitation Learning | CodeCode Available | 0 | 5 |
| Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem | Sep 9, 2021 | Car RacingQ-Learning | CodeCode Available | 0 | 5 |
| Dynamic-Weighted Simplex Strategy for Learning Enabled Cyber Physical Systems | Feb 6, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 | 5 |
| Augmented Q Imitation Learning (AQIL) | Mar 31, 2020 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning | Nov 30, 2021 | Autonomous VehiclesQ-Learning | CodeCode Available | 0 | 5 |
| Deep Coordination Graphs | Sep 27, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |