SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 401450 of 1918 papers

TitleStatusHype
A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based ApplicationsCode0
Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care0
Graph Q-Learning for Combinatorial Optimization0
Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval0
Deep Reinforcement Multi-agent Learning framework for Information Gathering with Local Gaussian Processes for Water Monitoring0
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement LearningCode0
Decision Making in Non-Stationary Environments with Policy-Augmented SearchCode0
An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments0
A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids0
The Best Time for an Update: Risk-Sensitive Minimization of Age-Based Metrics0
Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach0
Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach0
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism0
Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic0
Federated Q-Learning: Linear Regret Speedup with Low Communication Cost0
Maximum entropy GFlowNets with soft Q-learning0
Optimal coordination of resources: A solution from reinforcement learning0
Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown EnvironmentsCode0
Sample Efficient Reinforcement Learning with Partial Dynamics KnowledgeCode0
Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos0
Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility0
On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning0
Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks0
I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets InitiativesCode0
Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement LearningCode0
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and RegularizationCode0
Multi-Agent Reinforcement Learning via Distributed MPC as a Function ApproximatorCode1
Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications0
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games0
Efficient Parallel Reinforcement Learning Framework using the Reactor ModelCode0
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems0
Algorithmic collusion under competitive design0
Provable Reinforcement Learning for Networked Control Systems with Stochastic Packet Disordering0
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing0
Anomaly Detection via Learning-Based Sequential Controlled Sensing0
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control0
OpenSense: An Open-World Sensing Framework for Incremental Learning and Dynamic Sensor Scheduling on Embedded Edge Devices0
Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks0
Reinforcement Learning from Diffusion Feedback: Q* for Image Search0
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots0
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning0
Approximation of Convex Envelope Using Reinforcement Learning0
Learning to Cooperate and Communicate Over Imperfect Channels0
On optimal tracking portfolio in incomplete markets: The reinforcement learning approach0
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery0
Multi-intention Inverse Q-learning for Interpretable Behavior RepresentationCode0
Machine learning-based decentralized TDMA for VLC IoT networks0
Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion0
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets0
Show:102550
← PrevPage 9 of 39Next →

No leaderboard results yet.