| Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments | Oct 9, 2020 | Incremental LearningQ-Learning | CodeCode Available | 0 |
| Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and Methods | May 9, 2017 | Decision MakingQ-Learning | CodeCode Available | 0 |
| NARS vs. Reinforcement learning: ONA vs. Q-Learning | Dec 23, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces | Dec 1, 2019 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces | Jan 30, 2019 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| A Multi-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games | Jul 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation | Jun 12, 2024 | Q-Learning | CodeCode Available | 0 |
| Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning | Nov 19, 2018 | Active LearningQ-Learning | CodeCode Available | 0 |
| A Machine with Short-Term, Episodic, and Semantic Memory Systems | Dec 5, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis | Mar 25, 2022 | Medical Image AnalysisQ-Learning | CodeCode Available | 0 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 |
| Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters | Dec 1, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Robust Q-Learning for finite ambiguity sets | Jul 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Cooperation between Independent Market Makers | Jun 11, 2022 | Q-Learning | CodeCode Available | 0 |
| Robust Q-Learning under Corrupted Rewards | Sep 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks | Feb 10, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Active exploration in parameterized reinforcement learning | Oct 6, 2016 | Meta-LearningQ-Learning | CodeCode Available | 0 |
| Solving NP-Hard Problems on Graphs with Extended AlphaGo Zero | May 28, 2019 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 0 |
| Control with adaptive Q-learning | Nov 3, 2020 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| The Mean-Squared Error of Double Q-Learning | Jul 9, 2020 | Q-Learning | CodeCode Available | 0 |
| Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement Learning | Dec 10, 2023 | Q-Learning | CodeCode Available | 0 |
| Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPs | May 26, 2025 | Imitation LearningQ-Learning | CodeCode Available | 0 |
| SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots | Aug 3, 2021 | Model Predictive ControlMotion Planning | CodeCode Available | 0 |
| Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning | Nov 30, 2021 | Autonomous VehiclesQ-Learning | CodeCode Available | 0 |
| Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning | Nov 24, 2020 | NavigateQ-Learning | CodeCode Available | 0 |
| Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown Environments | Dec 19, 2023 | OpenAI GymPathfinder | CodeCode Available | 0 |
| Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima | May 24, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets Initiatives | Dec 12, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Assessing the Potential of Classical Q-learning in General Game Playing | Oct 14, 2018 | Board GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 |
| A Deep Recurrent Q Network towards Self-adapting Distributed Microservices architecture | Jan 13, 2019 | Decision MakingQ-Learning | CodeCode Available | 0 |
| ISL: A novel approach for deep exploration | Sep 13, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry | Jun 18, 2024 | Q-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 |
| Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning | Nov 7, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Provably efficient RL with Rich Observations via Latent State Decoding | Jan 25, 2019 | ClusteringQ-Learning | CodeCode Available | 0 |
| Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems | Apr 19, 2018 | Q-LearningStochastic Optimization | CodeCode Available | 0 |
| Join Query Optimization with Deep Reinforcement Learning Algorithms | Nov 26, 2019 | AttributeDeep Reinforcement Learning | CodeCode Available | 0 |
| Visual Exploration and Energy-aware Path Planning via Reinforcement Learning | Sep 26, 2019 | Autonomous Vehiclesobject-detection | CodeCode Available | 0 |
| Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target | Jan 22, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Joint Path planning and Power Allocation of a Cellular-Connected UAV using Apprenticeship Learning via Deep Inverse Reinforcement Learning | Jun 15, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning | May 6, 2016 | Atari GamesFPS Games | CodeCode Available | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Active Collection of Well-Being and Health Data in Mobile Devices | Jul 7, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection | May 16, 2019 | feature selectionMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Taming the Noise in Reinforcement Learning via Soft Updates | Dec 28, 2015 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Topological Experience Replay | Mar 29, 2022 | Q-Learning | CodeCode Available | 0 |
| A Kernel Loss for Solving the Bellman Equation | May 25, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling | Sep 16, 2024 | Combinatorial Optimizationcounterfactual | CodeCode Available | 0 |