Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

2020-06-15NeurIPS 2020Code Available1· sign in to hype

Yaodong Yu, Kwan Ho Ryan Chan, Chong You, Chaobing Song, Yi Ma

Code Available — Be the first to reproduce this paper.

Code

github.com/ryanchankh/mcr2
OfficialIn paperpytorch★ 203
github.com/Ma-Lab-Berkeley/MCR2
pytorch★ 86

Abstract

To learn intrinsic low-dimensional structures from high-dimensional data that most discriminate between classes, we propose the principle of Maximal Coding Rate Reduction (MCR^2), an information-theoretic measure that maximizes the coding rate difference between the whole dataset and the sum of each individual class. We clarify its relationships with most existing frameworks such as cross-entropy, information bottleneck, information gain, contractive and contrastive learning, and provide theoretical guarantees for learning diverse and discriminative features. The coding rate can be accurately computed from finite samples of degenerate subspace-like distributions and can learn intrinsic representations in supervised, self-supervised, and unsupervised settings in a unified manner. Empirically, the representations learned using this principle alone are significantly more robust to label corruptions in classification than those using cross-entropy, and can lead to state-of-the-art results in clustering mixed data from self-learned invariant features.

Tasks

Clustering Contrastive Learning Image Clustering

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
STL-10	MCR2	Accuracy	0.49	—	Unverified

Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

Code

Abstract

Tasks

Benchmark Results

Reproductions