SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 426450 of 1918 papers

TitleStatusHype
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and RegularizationCode0
Multi-Agent Reinforcement Learning via Distributed MPC as a Function ApproximatorCode1
Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications0
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games0
Efficient Parallel Reinforcement Learning Framework using the Reactor ModelCode0
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems0
Algorithmic collusion under competitive design0
Provable Reinforcement Learning for Networked Control Systems with Stochastic Packet Disordering0
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing0
Anomaly Detection via Learning-Based Sequential Controlled Sensing0
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control0
OpenSense: An Open-World Sensing Framework for Incremental Learning and Dynamic Sensor Scheduling on Embedded Edge Devices0
Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks0
Reinforcement Learning from Diffusion Feedback: Q* for Image Search0
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots0
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning0
Approximation of Convex Envelope Using Reinforcement Learning0
Learning to Cooperate and Communicate Over Imperfect Channels0
On optimal tracking portfolio in incomplete markets: The reinforcement learning approach0
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery0
Multi-intention Inverse Q-learning for Interpretable Behavior RepresentationCode0
Machine learning-based decentralized TDMA for VLC IoT networks0
Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion0
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets0
Show:102550
← PrevPage 18 of 77Next →

No leaderboard results yet.