SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 601625 of 1918 papers

TitleStatusHype
Applying Reinforcement Learning to Option Pricing and Hedging0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation0
A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback0
Using Reinforcement Learning to Optimize Responses in Care Processes: A Case Study on Aggression Incidents0
Pre-training with Synthetic Data Helps Offline Reinforcement LearningCode0
Reinforcement learning adaptive fuzzy controller for lighting systems: application to aircraft cabin0
Multi-Bellman operator for convergence of Q-learning with linear function approximation0
Decoding trust: A reinforcement learning perspective0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning0
Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions0
Differentiable Quantum Architecture Search for Quantum Reinforcement Learning0
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions0
Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications0
Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications0
Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems0
Dynamic control of self-assembly of quasicrystalline structures through reinforcement learningCode0
Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration0
A Q-learning Approach for Adherence-Aware Recommendations0
Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach0
Convex Q Learning in a Stochastic Environment: Extended Version0
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning0
Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks0
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic ArmCode0
Show:102550
← PrevPage 25 of 77Next →

No leaderboard results yet.