Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 1918 papers

Title	Date	Tasks	Status	Hype
Convex Q Learning in a Stochastic Environment: Extended Version	Sep 10, 2023	Q-Learning	—Unverified	0
Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks	Aug 31, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning	Aug 31, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences	Aug 28, 2023	Image ReconstructionQ-Learning	CodeCode Available	0
Traffic Light Control with Reinforcement Learning	Aug 28, 2023	Q-Learningreinforcement-learning	CodeCode Available	0
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm	Aug 28, 2023	Deep Reinforcement LearningQ-Learning	CodeCode Available	0
Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning	Aug 24, 2023	Motion PlanningNavigate	—Unverified	0
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi	Aug 20, 2023	Game of HanabiMulti-agent Reinforcement Learning	CodeCode Available	0
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games	Aug 17, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Reinforcement Learning for Battery Management in Dairy Farming	Aug 17, 2023	ManagementQ-Learning	—Unverified	0
On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing	Aug 15, 2023	Cloud ComputingCPU	—Unverified	0
A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control	Aug 10, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Variations on the Reinforcement Learning performance of Blackjack	Aug 9, 2023	Q-Learningreinforcement-learning	CodeCode Available	0
Deep Q-Network for Stochastic Process Environments	Aug 7, 2023	Q-Learningreinforcement-learning	—Unverified	0
Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence	Aug 7, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Minimax Optimal Q Learning with Nearest Neighbors	Aug 3, 2023	Q-Learning	—Unverified	0
Robust Multi-Agent Reinforcement Learning with State Uncertainty	Jul 30, 2023	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	1
Stability of Multi-Agent Learning: Convergence in Network Games with Many Players	Jul 26, 2023	Q-Learning	—Unverified	0
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation	Jul 24, 2023	GPUQ-Learning	CodeCode Available	0
Adversarial Agents For Attacking Inaudible Voice Activated Devices	Jul 23, 2023	CyberBattleSimQ-Learning	—Unverified	0
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning	Jul 22, 2023	Q-Learning	—Unverified	0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment	Jul 20, 2023	continuous-controlContinuous Control	CodeCode Available	0
Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks	Jul 18, 2023	Q-LearningReinforcement Learning (RL)	—Unverified	0
Meta-Value Learning: a General Framework for Learning with Learning Awareness	Jul 17, 2023	Q-Learning	CodeCode Available	0
Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents	Jul 16, 2023	Learning TheoryQ-Learning	—Unverified	0
Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach	Jul 13, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Realtime Spectrum Monitoring via Reinforcement Learning -- A Comparison Between Q-Learning and Heuristic Methods	Jul 11, 2023	ManagementQ-Learning	—Unverified	0
Investigating the Edge of Stability Phenomenon in Reinforcement Learning	Jul 9, 2023	Q-Learningreinforcement-learning	—Unverified	0
The Value of Chess Squares	Jul 8, 2023	Game of ChessQ-Learning	—Unverified	0
Active Collection of Well-Being and Health Data in Mobile Devices	Jul 7, 2023	Q-LearningReinforcement Learning (RL)	CodeCode Available	0
Offline Reinforcement Learning with Imbalanced Datasets	Jul 6, 2023	D4RLOffline RL	—Unverified	0
Elastic Decision Transformer	Jul 5, 2023	Atari GamesD4RL	—Unverified	0
Stability of Q-Learning Through Design and Optimism	Jul 5, 2023	Q-Learning	—Unverified	0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning	Jul 5, 2023	Offline RLQ-Learning	—Unverified	0
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning	Jul 3, 2023	Q-Learningreinforcement-learning	—Unverified	0
Is Risk-Sensitive Reinforcement Learning Properly Resolved?	Jul 2, 2023	Distributional Reinforcement LearningManagement	—Unverified	0
Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective	Jun 29, 2023	Feature EngineeringQ-Learning	CodeCode Available	0
Evaluation of Reinforcement Learning Techniques for Trading on a Diverse Portfolio	Jun 28, 2023	Q-Learningreinforcement-learning	—Unverified	0
Continuous-time q-learning for mean-field control problems	Jun 28, 2023	Q-Learning	—Unverified	0
Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning	Jun 27, 2023	Decision MakingQ-Learning	—Unverified	0
RansomAI: AI-powered Ransomware for Stealthy Encryption	Jun 27, 2023	Q-LearningRaspberry Pi 4	—Unverified	0
Decentralized Multi-Robot Formation Control Using Reinforcement Learning	Jun 26, 2023	Q-Learningreinforcement-learning	—Unverified	0
Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query	Jun 24, 2023	Atari GamesDecision Making	—Unverified	0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback	Jun 20, 2023	MuJoCoQ-Learning	—Unverified	0
Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation	Jun 20, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm	Jun 17, 2023	Atari GamesQ-Learning	—Unverified	0
Algorithmic Collusion in Auctions: Evidence from Controlled Laboratory Experiments	Jun 15, 2023	Q-Learning	—Unverified	0
Joint Path planning and Power Allocation of a Cellular-Connected UAV using Apprenticeship Learning via Deep Inverse Reinforcement Learning	Jun 15, 2023	Deep Reinforcement LearningQ-Learning	CodeCode Available	0
Residual Q-Learning: Offline and Online Policy Customization without Value	Jun 15, 2023	Imitation LearningQ-Learning	—Unverified	0
Privacy Risks in Reinforcement Learning for Household Robots	Jun 15, 2023	Decision MakingFederated Learning	—Unverified	0

Show:10 25 50

← PrevPage 11 of 39Next →

No leaderboard results yet.