SOTAVerified

Self-Supervised Image Classification

This is the task of image classification using representations learnt with self-supervised learning. Self-supervised methods generally involve a pretext task that is solved to learn a good representation and a loss function to learn with. One example of a loss function is an autoencoder based loss where the goal is reconstruction of an image pixel-by-pixel. A more popular recent example is a contrastive loss, which measure the similarity of sample pairs in a representation space, and where there can be a varying target instead of a fixed target to reconstruct (as in the case of autoencoders).

A common evaluation protocol is to train a linear classifier on top of (frozen) representations learnt by self-supervised methods. The leaderboards for the linear evaluation protocol can be found below. In practice, it is more common to fine-tune features on a downstream task. An alternative evaluation protocol therefore uses semi-supervised learning and finetunes on a % of the labels. The leaderboards for the finetuning protocol can be accessed here.

You may want to read some blog posts before reading the papers and checking the leaderboards:

Contrastive Self-Supervised Learning - Ankesh Anand
The Illustrated Self-Supervised Learning - Amit Chaudhary
Self-supervised learning and computer vision - Jeremy Howard
Self-Supervised Representation Learning - Lilian Weng

There is also Yann LeCun's talk at AAAI-20 which you can watch here (35:00+).

( Image credit: A Simple Framework for Contrastive Learning of Visual Representations )

Title	Date	Tasks	Status	Score
Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?	Jan 13, 2022	image-classificationImage Classification	CodeCode Available	5
Unsupervised Representation Learning by Balanced Self Attention Matching	Aug 4, 2024	Representation LearningSelf-Supervised Image Classification	CodeCode Available	5
Efficient Self-supervised Vision Transformers for Representation Learning	Jun 17, 2021	Representation LearningSelf-Supervised Image Classification	CodeCode Available	5
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping	Jul 13, 2022	ClusteringContrastive Learning	CodeCode Available	5
Unsupervised Pre-Training of Image Features on Non-Curated Data	May 3, 2019	ClusteringSelf-Supervised Image Classification	CodeCode Available	5
Local Aggregation for Unsupervised Learning of Visual Embeddings	Mar 29, 2019	ClusteringContrastive Learning	CodeCode Available	5
Revisiting Self-Supervised Visual Representation Learning	Jan 25, 2019	Representation LearningSelf-Supervised Image Classification	CodeCode Available	5
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision	Feb 16, 2022	Action ClassificationAction Recognition	CodeCode Available	5
Masked Image Residual Learning for Scaling Deeper Vision Transformers	Sep 25, 2023	Image Classificationobject-detection	CodeCode Available	5
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers	Aug 12, 2022	image-classificationImage Classification	CodeCode Available	5

Title

Status

Hype

Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?

CodeCode Available

Unsupervised Representation Learning by Balanced Self Attention Matching

CodeCode Available

Efficient Self-supervised Vision Transformers for Representation Learning

CodeCode Available

Unsupervised Visual Representation Learning by Synchronous Momentum Grouping

CodeCode Available

Unsupervised Pre-Training of Image Features on Non-Curated Data