rQdia: Regularizing Q-Value Distributions With Image Augmentation

2025-06-26Unverified0· sign in to hype

Sam Lerman, Jing Bi

Unverified — Be the first to reproduce this paper.

Abstract

rQdia regularizes Q-value distributions with augmented images in pixel-based deep reinforcement learning. With a simple auxiliary loss, that equalizes these distributions via MSE, rQdia boosts DrQ and SAC on 9/12 and 10/12 tasks respectively in the MuJoCo Continuous Control Suite from pixels, and Data-Efficient Rainbow on 18/26 Atari Arcade environments. Gains are measured in both sample efficiency and longer-term training. Moreover, the addition of rQdia finally propels model-free continuous control from pixels over the state encoding baseline.

Tasks

continuous-control Continuous Control Deep Reinforcement Learning Image Augmentation MuJoCo

rQdia: Regularizing Q-Value Distributions With Image Augmentation

Abstract

Tasks

Reproductions