SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 851875 of 1918 papers

TitleStatusHype
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision0
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization0
Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication0
A new convergent variant of Q-learning with linear function approximation0
Imagination-Limited Q-Learning for Offline Reinforcement Learning0
Imitating Language via Scalable Inverse Reinforcement Learning0
Implementing Inductive bias for different navigation tasks through diverse RNN attrractors0
Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle0
Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning0
Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication0
Causal Mean Field Multi-Agent Reinforcement Learning0
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search0
Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning0
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons0
Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning0
Improving Search through A3C Reinforcement Learning based Conversational Agent0
Causal Deep Reinforcement Learning Using Observational Data0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward0
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization0
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning0
EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning0
Encoders and Decoders for Quantum Expander Codes Using Machine Learning0
Show:102550
← PrevPage 35 of 77Next →

No leaderboard results yet.