SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 851900 of 1918 papers

TitleStatusHype
A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management0
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity0
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation0
Autonomous Warehouse Robot using Deep Q-Learning0
PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
Goal Recognition as Reinforcement LearningCode0
Artificial Intelligence and Auction Design0
Microservice Deployment in Edge Computing Based on Deep Q LearningCode1
Regularized Q-learning0
Transferred Q-learning0
Intelligent Autonomous Intersection Management0
Multiple Correlated Jammers Nullification using LSTM-based Deep Dueling Neural Network0
Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning0
A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise0
Deep Q-learning: a robust control approachCode0
Optimal variance-reduced stochastic approximation in Banach spaces0
Deep Reinforcement Learning with Spiking Q-learning0
Addressing Maximization Bias in Reinforcement Learning with Two-Sample TestingCode1
A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning0
Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning0
Task Independent Capsule-Based Agents for Deep Q-Learning0
Age-of-information minimization via opportunistic sampling by an energy harvesting source0
Sales Time Series Analytics Using Deep Q-Learning0
Reinforcement Learning for Task Specifications with Action-Constraints0
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning0
A Statistical Analysis of Polyak-Ruppert Averaged Q-learningCode0
A Graph Attention Learning Approach to Antenna Tilt Optimization0
Task and Model Agnostic Adversarial Attack on Graph Neural NetworksCode0
Safety and Liveness Guarantees through Reach-Avoid Reinforcement LearningCode1
Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach0
Amortized Noisy Channel Neural Machine Translation0
Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games0
Teaching a Robot to Walk Using Reinforcement Learning0
Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control0
Quantum Architecture Search via Continual Reinforcement Learning0
High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning0
Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market0
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical PerspectivesCode1
Convergence Results For Q-Learning With Experience Replay0
Application of Deep Reinforcement Learning to Payment Fraud0
Replay For Safety0
Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi0
A Risk-Averse Preview-based Q-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles0
Regularized Softmax Deep Multi-Agent Q-LearningCode1
Finite Sample Analysis of Average-Reward TD Learning and Q-Learning0
Faster Non-asymptotic Convergence for Double Q-learning0
Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learningCode0
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks0
Show:102550
← PrevPage 18 of 39Next →

No leaderboard results yet.