SOTAVerified

Self-Supervised Image Classification

This is the task of image classification using representations learnt with self-supervised learning. Self-supervised methods generally involve a pretext task that is solved to learn a good representation and a loss function to learn with. One example of a loss function is an autoencoder based loss where the goal is reconstruction of an image pixel-by-pixel. A more popular recent example is a contrastive loss, which measure the similarity of sample pairs in a representation space, and where there can be a varying target instead of a fixed target to reconstruct (as in the case of autoencoders).

A common evaluation protocol is to train a linear classifier on top of (frozen) representations learnt by self-supervised methods. The leaderboards for the linear evaluation protocol can be found below. In practice, it is more common to fine-tune features on a downstream task. An alternative evaluation protocol therefore uses semi-supervised learning and finetunes on a % of the labels. The leaderboards for the finetuning protocol can be accessed here.

You may want to read some blog posts before reading the papers and checking the leaderboards:

Contrastive Self-Supervised Learning - Ankesh Anand
The Illustrated Self-Supervised Learning - Amit Chaudhary
Self-supervised learning and computer vision - Jeremy Howard
Self-Supervised Representation Learning - Lilian Weng

There is also Yann LeCun's talk at AAAI-20 which you can watch here (35:00+).

( Image credit: A Simple Framework for Contrastive Learning of Visual Representations )

Title	Date	Tasks	Status
Exploring Target Representations for Masked Autoencoders	Sep 8, 2022	Image ClassificationInstance Segmentation	CodeCode Available
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers	Aug 12, 2022	image-classificationImage Classification	CodeCode Available
Model-Aware Contrastive Learning: Towards Escaping the Dilemmas	Jul 16, 2022	Contrastive LearningGraph Representation Learning	CodeCode Available
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping	Jul 13, 2022	ClusteringContrastive Learning	CodeCode Available
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision	Feb 16, 2022	Action ClassificationAction Recognition	CodeCode Available
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework	Feb 7, 2022	Image Captioningimage-classification	CodeCode Available
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?	Jan 13, 2022	image-classificationImage Classification	CodeCode Available
Efficient Self-supervised Vision Transformers for Representation Learning	Jun 17, 2021	Representation LearningSelf-Supervised Image Classification	CodeCode Available
Large-Scale Unsupervised Person Re-Identification with Contrastive Learning	May 17, 2021	Contrastive LearningDomain Adaptation	—Unverified
Divide and Contrast: Self-supervised Learning from Uncurated Data	May 17, 2021	ClusteringContrastive Learning	—Unverified

Title

Status

Hype

Exploring Target Representations for Masked Autoencoders

CodeCode Available

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers

CodeCode Available

Model-Aware Contrastive Learning: Towards Escaping the Dilemmas

CodeCode Available

Unsupervised Visual Representation Learning by Synchronous Momentum Grouping

CodeCode Available

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision