Bregman Centroid Guided Cross-Entropy Method

2025-06-02Unverified0· sign in to hype

Yuliang Gu, Hongpeng Cao, Marco Caccamo, Naira Hovakimyan

Unverified — Be the first to reproduce this paper.

Abstract

The Cross-Entropy Method (CEM) is a widely adopted trajectory optimizer in model-based reinforcement learning (MBRL), but its unimodal sampling strategy often leads to premature convergence in multimodal landscapes. In this work, we propose Bregman Centroid Guided CEM (BC-EvoCEM), a lightweight enhancement to ensemble CEM that leverages Bregman centroids for principled information aggregation and diversity control. BC-EvoCEM computes a performance-weighted Bregman centroid across CEM workers and updates the least contributing ones by sampling within a trust region around the centroid. Leveraging the duality between Bregman divergences and exponential family distributions, we show that BC-EvoCEM integrates seamlessly into standard CEM pipelines with negligible overhead. Empirical results on synthetic benchmarks, a cluttered navigation task, and full MBRL pipelines demonstrate that BC-EvoCEM enhances both convergence and solution quality, providing a simple yet effective upgrade for CEM.

Tasks

Diversity Model-based Reinforcement Learning

Bregman Centroid Guided Cross-Entropy Method

Abstract

Tasks

Reproductions