SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 751800 of 1918 papers

TitleStatusHype
Logit-Q Dynamics for Efficient Learning in Stochastic Teams0
Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels0
A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning0
Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading0
MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework0
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage0
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition0
Best Possible Q-Learning0
Sample Complexity of Kernel-Based Q-Learning0
The impact of surplus sharing on the outcomes of specific investments under negotiated transfer pricing: An agent-based simulation with fuzzy Q-learning agents0
Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense0
RCsearcher: Reaction Center Identification in Retrosynthesis via Deep Q-Learning0
Single-Trajectory Distributionally Robust Reinforcement Learning0
Learning from Multiple Independent Advisors in Multi-agent Reinforcement LearningCode0
FedHQL: Federated Heterogeneous Q-Learning0
Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics0
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets0
Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures0
Decentralized model-free reinforcement learning in stochastic games with average-reward objective0
Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity0
Multi-Power Level Q-Learning Algorithm for Random Access in NOMA mMTC Systems0
Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning0
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning0
Contextual Conservative Q-Learning for Offline Reinforcement Learning0
Deep Spectral Q-learning with Application to Mobile Health0
NARS vs. Reinforcement learning: ONA vs. Q-LearningCode0
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse0
Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement LearningCode0
Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation0
Taming Lagrangian Chaos with Multi-Objective Reinforcement Learning0
Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling0
Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNetCode0
VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation0
Frugal Reinforcement-based Active Learning0
Reinforcement Learning for Resilient Power Grids0
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning0
EASpace: Enhanced Action Space for Policy TransferCode0
A Machine with Short-Term, Episodic, and Semantic Memory SystemsCode0
Automata Learning meets ShieldingCode0
Welfare and Fairness in Multi-objective Reinforcement LearningCode0
Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning0
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning0
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols0
Causal Deep Reinforcement Learning Using Observational Data0
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes0
UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms0
Explainable and Safe Reinforcement Learning for Autonomous Air MobilityCode0
Show:102550
← PrevPage 16 of 39Next →

No leaderboard results yet.