| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 |
| Optimizing Returns Using the Hurst Exponent and Q Learning on Momentum and Mean Reversion Strategies | May 23, 2022 | Q-LearningTime Series | —Unverified | 0 |
| Reinforced Pedestrian Attribute Recognition with Group Optimization Reward | May 21, 2022 | AttributeDecision Making | —Unverified | 0 |
| Parallel bandit architecture based on laser chaos for reinforcement learning | May 19, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing | May 14, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Representation Learning for Context-Dependent Decision-Making | May 12, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Final Iteration Convergence Bound of Q-Learning: Switching System Approach | May 11, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Characterizing the Action-Generalization Gap in Deep Q-Learning | May 11, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Neuromimetic Linear Systems -- Resilience and Learning | May 10, 2022 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods | May 8, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Vehicle management in a modular production context using Deep Q-Learning | May 6, 2022 | Deep Reinforcement LearningJob Shop Scheduling | —Unverified | 0 |
| Chemoreception and chemotaxis of a three-sphere swimmer | May 5, 2022 | Q-Learning | —Unverified | 0 |
| Q-Learning Scheduler for Multi Task Learning Through the use of Histogram of Task Uncertainty | May 1, 2022 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Learning Value Functions from Undirected State-only Experience | Apr 26, 2022 | Future predictionImitation Learning | —Unverified | 0 |
| Graph Neural Network based Agent in Google Research Football | Apr 23, 2022 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Provably Efficient Kernelized Q-Learning | Apr 21, 2022 | Q-Learning | —Unverified | 0 |
| Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics | Apr 20, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Efficient and practical quantum compiler towards multi-qubit systems with deep reinforcement learning | Apr 14, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping | Apr 7, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-learning with online random forests | Apr 7, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-learning of global optimizer of multiply model parameters for viscoelastic imaging | Apr 1, 2022 | Decision MakingDiagnostic | —Unverified | 0 |
| Neural Q-learning for solving PDEs | Mar 31, 2022 | Q-Learning | —Unverified | 0 |
| Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory | Mar 31, 2022 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Investigating the Properties of Neural Network Representations in Reinforcement Learning | Mar 30, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Topological Experience Replay | Mar 29, 2022 | Q-Learning | CodeCode Available | 0 |
| Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis | Mar 25, 2022 | Medical Image AnalysisQ-Learning | CodeCode Available | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| The state-of-the-art review on resource allocation problem using artificial intelligence methods on various computing paradigms | Mar 23, 2022 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle | Mar 22, 2022 | Q-Learning | —Unverified | 0 |
| Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks | Mar 22, 2022 | Q-Learning | CodeCode Available | 0 |
| Distributed Learning for Vehicular Dynamic Spectrum Access in Autonomous Driving | Mar 22, 2022 | Autonomous Drivingchannel selection | —Unverified | 0 |
| Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning | Mar 18, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Orchestrated Value Mapping for Reinforcement Learning | Mar 14, 2022 | Ensemble LearningQ-Learning | CodeCode Available | 0 |
| The Efficacy of Pessimism in Asynchronous Q-Learning | Mar 14, 2022 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning for Optimal Control of a District Cooling Energy Plant | Mar 14, 2022 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets | Mar 11, 2022 | BIG-bench Machine LearningManagement | —Unverified | 0 |
| Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery | Mar 8, 2022 | global-optimizationMotion Planning | —Unverified | 0 |
| Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility | Mar 7, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Target Network and Truncation Overcome The Deadly Triad in Q-Learning | Mar 5, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise | Mar 2, 2022 | Atari GamesDiversity | —Unverified | 0 |
| A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management | Mar 2, 2022 | ManagementQ-Learning | —Unverified | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation | Feb 26, 2022 | Edge-computingQ-Learning | —Unverified | 0 |
| Autonomous Warehouse Robot using Deep Q-Learning | Feb 21, 2022 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning | Feb 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations | Feb 17, 2022 | ClusteringQ-Learning | —Unverified | 0 |
| Goal Recognition as Reinforcement Learning | Feb 13, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Artificial Intelligence and Auction Design | Feb 12, 2022 | Q-Learning | —Unverified | 0 |
| Regularized Q-learning | Feb 11, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |