SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 601650 of 1918 papers

TitleStatusHype
Applying Reinforcement Learning to Option Pricing and Hedging0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback0
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation0
Using Reinforcement Learning to Optimize Responses in Care Processes: A Case Study on Aggression Incidents0
Pre-training with Synthetic Data Helps Offline Reinforcement LearningCode0
Reinforcement learning adaptive fuzzy controller for lighting systems: application to aircraft cabin0
Multi-Bellman operator for convergence of Q-learning with linear function approximation0
Decoding trust: A reinforcement learning perspective0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning0
Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions0
Differentiable Quantum Architecture Search for Quantum Reinforcement Learning0
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions0
Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications0
Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications0
Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems0
Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration0
Dynamic control of self-assembly of quasicrystalline structures through reinforcement learningCode0
A Q-learning Approach for Adherence-Aware Recommendations0
Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach0
Convex Q Learning in a Stochastic Environment: Extended Version0
Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks0
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning0
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic ArmCode0
Reinforcement Learning for Sampling on Temporal Medical Imaging SequencesCode0
Traffic Light Control with Reinforcement LearningCode0
Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning0
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of HanabiCode0
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games0
Reinforcement Learning for Battery Management in Dairy Farming0
On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing0
A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control0
Variations on the Reinforcement Learning performance of BlackjackCode0
Deep Q-Network for Stochastic Process Environments0
Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence0
Minimax Optimal Q Learning with Nearest Neighbors0
Stability of Multi-Agent Learning: Convergence in Network Games with Many Players0
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel SimulationCode0
Adversarial Agents For Attacking Inaudible Voice Activated Devices0
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environmentCode0
Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks0
Meta-Value Learning: a General Framework for Learning with Learning AwarenessCode0
Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents0
Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach0
Realtime Spectrum Monitoring via Reinforcement Learning -- A Comparison Between Q-Learning and Heuristic Methods0
Investigating the Edge of Stability Phenomenon in Reinforcement Learning0
The Value of Chess Squares0
Active Collection of Well-Being and Health Data in Mobile DevicesCode0
Show:102550
← PrevPage 13 of 39Next →

No leaderboard results yet.