Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 5044 papers

Title	Date	Tasks	Status	Hype	Score
HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation	Dec 14, 2020	Depth EstimationMonocular Depth Estimation	CodeCode Available	1	5
eProduct: A Million-Scale Visual Search Benchmark to Address Product Recognition Challenges	Jul 13, 2021	DiversitySelf-Supervised Learning	CodeCode Available	1	5
Human-machine Interactive Tissue Prototype Learning for Label-efficient Histopathology Image Segmentation	Nov 26, 2022	Contrastive LearningImage Segmentation	CodeCode Available	1	5
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation	Mar 18, 2023	Contrastive LearningImage Segmentation	CodeCode Available	1	5
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling	Dec 18, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text	May 26, 2024	Arrhythmia DetectionRAG	CodeCode Available	1	5
Benchmarking Omni-Vision Representation through the Lens of Visual Realms	Jul 14, 2022	BenchmarkingContrastive Learning	CodeCode Available	1	5
Hyper-Representations for Pre-Training and Transfer Learning	Jul 22, 2022	Knowledge DistillationNeural Architecture Search	CodeCode Available	1	5
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation	Mar 11, 2021	Representation LearningSelf-Supervised Learning	CodeCode Available	1	5
EchoFM: Foundation Model for Generalizable Echocardiogram Analysis	Oct 30, 2024	Contrastive Learningmodel	CodeCode Available	1	5
Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation	Apr 18, 2022	Self-Supervised Learning	CodeCode Available	1	5
Echo-SyncNet: Self-supervised Cardiac View Synchronization in Echocardiography	Feb 3, 2021	One-Shot LearningSelf-Supervised Learning	CodeCode Available	1	5
Deep learning powered real-time identification of insects using citizen science data	Jun 4, 2023	ManagementSelf-Supervised Learning	CodeCode Available	1	5
APSNet: Attention Based Point Cloud Sampling	Oct 11, 2022	3D Point Cloud ClassificationKnowledge Distillation	CodeCode Available	1	5
Enhanced Masked Image Modeling to Avoid Model Collapse on Multi-modal MRI Datasets	Jul 15, 2024	MRI segmentationSelf-Supervised Learning	CodeCode Available	1	5
Implicit Autoencoder for Point-Cloud Self-Supervised Representation Learning	Jan 3, 2022	3D geometry3D Point Cloud Classification	CodeCode Available	1	5
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification	Mar 8, 2022	image-classificationImage Classification	CodeCode Available	1	5
Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs	Jul 16, 2020	Auxiliary LearningLink Prediction	CodeCode Available	1	5
Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised training	Feb 16, 2022	Data AugmentationSelf-Supervised Learning	CodeCode Available	1	5
Effective Self-supervised Pre-training on Low-compute Networks without Distillation	Oct 6, 2022	AttributeInstance Segmentation	CodeCode Available	1	5
EnCodecMAE: Leveraging neural codecs for universal audio representation learning	Sep 14, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Efficiency for Free: Ideal Data Are Transportable Representations	May 23, 2024	Dataset DistillationRepresentation Learning	CodeCode Available	1	5
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition	Feb 7, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
A Random CNN Sees Objects: One Inductive Bias of CNN and Its Applications	Jun 17, 2021	Inductive BiasObject	CodeCode Available	1	5
End-to-end Multi-modal Video Temporal Grounding	Jul 12, 2021	Optical Flow EstimationSelf-Supervised Learning	CodeCode Available	1	5
Overcoming Data Limitations: A Few-Shot Specific Emitter Identification Method Using Self-Supervised Learning and Adversarial Augmentation	Sep 7, 2023	Self-Supervised LearningTransfer Learning	CodeCode Available	1	5
Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning	Dec 11, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	1	5
Improving Adaptive Conformal Prediction Using Self-Supervised Learning	Feb 23, 2023	Conformal PredictionPrediction	CodeCode Available	1	5
Improving Knowledge-aware Recommendation with Multi-level Interactive Contrastive Learning	Aug 22, 2022	Contrastive LearningKnowledge-Aware Recommendation	CodeCode Available	1	5
Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining	Aug 8, 2023	Classificationimage-classification	CodeCode Available	1	5
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective	Jul 10, 2024	BenchmarkingDiagnostic	CodeCode Available	1	5
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining	Oct 4, 2022	Keyword SpottingSelf-Supervised Learning	CodeCode Available	1	5
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment	Mar 29, 2022	Phoneme RecognitionPseudo Label	CodeCode Available	1	5
Efficient Representation Learning for Healthcare with Cross-Architectural Self-Supervision	Aug 19, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	1	5
Improving Self-Supervised Learning by Characterizing Idealized Representations	Sep 13, 2022	Contrastive LearningSelf-Supervised Learning	CodeCode Available	1	5
Improving Representation Learning for Histopathologic Images with Cluster Constraints	Oct 18, 2023	ClusteringRepresentation Learning	CodeCode Available	1	5
End-to-end Multiple Instance Learning with Gradient Accumulation	Mar 8, 2022	GPUMultiple Instance Learning	CodeCode Available	1	5
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language	Dec 14, 2022	Decoderimage-classification	CodeCode Available	1	5
Efficient Self-Supervised Video Hashing with Selective State Spaces	Dec 19, 2024	DecoderMamba	CodeCode Available	1	5
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction	Jun 1, 2022	image-classificationImage Classification	CodeCode Available	1	5
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision	Jun 21, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Improving Self-supervised Pre-training using Accent-Specific Codebooks	Jul 4, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering	Dec 17, 2020	Question AnsweringSelf-Supervised Learning	CodeCode Available	1	5
Energy-Based Contrastive Learning of Visual Representations	Feb 10, 2022	Contrastive LearningSelf-Supervised Learning	CodeCode Available	1	5
Adversarial Graph Augmentation to Improve Graph Contrastive Learning	Jun 10, 2021	Contrastive LearningSelf-Supervised Learning	CodeCode Available	1	5
IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction	May 28, 2025	Missing ValuesSelf-Supervised Learning	CodeCode Available	1	5
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens	Nov 19, 2022	Action RecognitionObject State Change Classification	CodeCode Available	1	5
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning	Oct 17, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	1	5
A Regularization-Guided Equivariant Approach for Image Restoration	May 26, 2025	Data AugmentationImage Restoration	CodeCode Available	1	5
Optimal Representations for Covariate Shift	Dec 31, 2021	Domain GeneralizationImage Classification	CodeCode Available	1	5

Show:10 25 50

← PrevPage 26 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified