SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 851900 of 1918 papers

TitleStatusHype
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Action Learning for 3D Point Cloud Based Organ Segmentation0
Enhanced Deep Q-Learning for 2D Self-Driving Cars: Implementation and Evaluation on a Custom Track Environment0
Energy Sharing for Multiple Sensor Nodes with Finite Buffers0
Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision0
Imagination-Limited Q-Learning for Offline Reinforcement Learning0
Imitating Language via Scalable Inverse Reinforcement Learning0
Implementing Inductive bias for different navigation tasks through diverse RNN attrractors0
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization0
Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning0
Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication0
Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication0
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search0
A new convergent variant of Q-learning with linear function approximation0
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons0
Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle0
Improving Search through A3C Reinforcement Learning based Conversational Agent0
Causal Mean Field Multi-Agent Reinforcement Learning0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning0
Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning0
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning0
Causal Deep Reinforcement Learning Using Observational Data0
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward0
Information Theoretic Model Predictive Q-Learning0
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization0
In Hindsight: A Smooth Reward for Steady Exploration0
In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning0
Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning0
EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning0
Encoders and Decoders for Quantum Expander Codes Using Machine Learning0
Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations0
Integrated Sensing and Communication Neighbor Discovery for MANET with Gossip Mechanism0
Integrated trucks assignment and scheduling problem with mixed service mode docks: A Q-learning based adaptive large neighborhood search algorithm0
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments0
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms0
Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties0
Intelligent Autonomous Intersection Management0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning0
Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping0
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving0
Interactive Learning from Natural Language and Demonstrations using Signal Temporal Logic0
Empirical Q-Value Iteration0
Internet of Things Applications: Animal Monitoring with Unmanned Aerial Vehicle0
Deep Constrained Q-learning0
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS0
Show:102550
← PrevPage 18 of 39Next →

No leaderboard results yet.