SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 551600 of 1918 papers

TitleStatusHype
Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility0
On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning0
Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks0
I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets InitiativesCode0
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and RegularizationCode0
Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement LearningCode0
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games0
Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications0
Efficient Parallel Reinforcement Learning Framework using the Reactor ModelCode0
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems0
Provable Reinforcement Learning for Networked Control Systems with Stochastic Packet Disordering0
A Q-learning approach to the continuous control problem of robot inverted pendulum balancing0
Algorithmic collusion under competitive design0
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control0
Anomaly Detection via Learning-Based Sequential Controlled Sensing0
Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks0
OpenSense: An Open-World Sensing Framework for Incremental Learning and Dynamic Sensor Scheduling on Embedded Edge Devices0
Reinforcement Learning from Diffusion Feedback: Q* for Image Search0
FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots0
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning0
Approximation of Convex Envelope Using Reinforcement Learning0
Learning to Cooperate and Communicate Over Imperfect Channels0
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery0
On optimal tracking portfolio in incomplete markets: The reinforcement learning approach0
Multi-intention Inverse Q-learning for Interpretable Behavior RepresentationCode0
Machine learning-based decentralized TDMA for VLC IoT networks0
Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion0
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets0
Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems0
Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models0
Pointer Networks with Q-Learning for Combinatorial Optimization0
Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments0
DGFN: Double Generative Flow Networks0
Weakly Coupled Deep Q-Networks0
Lifting the Veil: Unlocking the Power of Depth in Q-learning0
Model-free Posterior Sampling via Learning Rate Randomization0
Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations0
Reinforcement learning based local path planning for mobile robot0
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ε-Greedy Exploration0
AI on the Water: Applying DRL to Autonomous Vessel Navigation0
Bad Values but Good Behavior: Learning Highly Misspecified Bandits and MDPs0
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy ApproachCode0
Integrated Sensing and Communication Neighbor Discovery for MANET with Gossip Mechanism0
Suppressing Overestimation in Q-Learning through Adversarial Behaviors0
Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning0
Dynamic value alignment through preference aggregation of multiple objectives0
DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather DataCode0
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network0
Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation0
Show:102550
← PrevPage 12 of 39Next →

No leaderboard results yet.