SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 901950 of 1918 papers

TitleStatusHype
Inverse Policy Evaluation for Value-based Sequential Decision-making0
A new convergent variant of Q-learning with linear function approximation0
Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle0
Investigating Reinforcement Learning Agents for Continuous State Space Environments0
Investigating the Edge of Stability Phenomenon in Reinforcement Learning0
Causal Mean Field Multi-Agent Reinforcement Learning0
Investigating the Properties of Neural Network Representations in Reinforcement Learning0
Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning0
IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture0
Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning0
Causal Deep Reinforcement Learning Using Observational Data0
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward0
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization0
EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning0
Encoders and Decoders for Quantum Expander Codes Using Machine Learning0
Is Risk-Sensitive Reinforcement Learning Properly Resolved?0
"Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications0
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator0
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics0
Empirical Q-Value Iteration0
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS0
KAN v.s. MLP for Offline Reinforcement Learning0
Empirically Evaluating Multiagent Learning Algorithms0
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
K-spin Hamiltonian for quantum-resolvable Markov decision processes0
Language Inference with Multi-head Automata through Reinforcement Learning0
Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning0
Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring0
Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement Learning0
Catalytic evolution of cooperation in a population with behavioural bimodality0
Emergence of cooperation under punishment: A reinforcement learning perspective0
Emergence of Addictive Behaviors in Reinforcement Learning Agents0
CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network0
A Network Simulation of OTC Markets with Multiple Agents0
Learning Automata Based Q-learning for Content Placement in Cooperative Caching0
A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support0
Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia0
Learning Best Response Strategies for Agents in Ad Exchanges0
Learning Control for Air Hockey Striking using Deep Reinforcement Learning0
Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors0
Learning Dialog Policies from Weak Demonstrations0
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD0
Learning Explicit Credit Assignment for Multi-agent Joint Q-learning0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing0
Elastic Decision Transformer0
Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach0
Show:102550
← PrevPage 19 of 39Next →

No leaderboard results yet.