SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15511575 of 1918 papers

TitleStatusHype
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity0
Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity0
A Graph Attention Learning Approach to Antenna Tilt Optimization0
A Hybrid PAC Reinforcement Learning Algorithm0
A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions0
AI on the Water: Applying DRL to Autonomous Vessel Navigation0
A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks0
A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows0
A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management0
Algorithmic Collusion and Price Discrimination: The Over-Usage of Data0
Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning0
Algorithmic Collusion under Observed Demand Shocks0
Algorithmic Trading with Fitted Q Iteration and Heston Model0
A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning0
Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise0
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants0
A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets0
A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks0
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning0
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret0
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
Amortized Noisy Channel Neural Machine Translation0
Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways0
Show:102550
← PrevPage 63 of 77Next →

No leaderboard results yet.