The Differentiable Cross-Entropy Method

2019-09-27ICML 2020Code Available0· sign in to hype

Brandon Amos, Denis Yarats

Code Available — Be the first to reproduce this paper.

Code

github.com/facebookresearch/dcem
OfficialIn paperpytorch★ 0

Abstract

We study the cross-entropy method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible. We show applications in a synthetic energy-based structured prediction task and in non-convex continuous control. In the control setting we show how to embed optimal action sequences into a lower-dimensional space. DCEM enables us to fine-tune CEM-based controllers with policy optimization.

Tasks

BIG-bench Machine Learning continuous-control Continuous Control Structured Prediction

The Differentiable Cross-Entropy Method

Code

Abstract

Tasks

Reproductions