Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 5044 papers

Title	Date	Tasks	Status	Hype	Score
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2	5
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset	Jun 29, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
GraphGPT: Graph Instruction Tuning for Large Language Models	Oct 19, 2023	Data AugmentationGraph Learning	CodeCode Available	2	5
AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning	Apr 12, 2022	Activity RecognitionDomain Adaptation	CodeCode Available	2	5
Multiview Compressive Coding for 3D Reconstruction	Jan 19, 2023	3D ReconstructionDecoder	CodeCode Available	2	5
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner	Apr 10, 2023	Self-Supervised Learning	CodeCode Available	2	5
OmniSat: Self-Supervised Modality Fusion for Earth Observation	Apr 12, 2024	DiversityEarth Observation	CodeCode Available	2	5
PaPaGei: Open Foundation Models for Optical Physiological Signals	Oct 27, 2024	Contrastive LearningDomain Generalization	CodeCode Available	2	5
PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders	Aug 16, 2024	3D Object Classification3D Point Cloud Classification	CodeCode Available	2	5
Pengi: An Audio Language Model for Audio Tasks	May 19, 2023	Audio captioningAudio Question Answering	CodeCode Available	2	5
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection	Feb 5, 2024	Objectobject-detection	CodeCode Available	2	5
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection	Sep 26, 2024	Event DetectionRepresentation Learning	CodeCode Available	2	5
InfMAE: A Foundation Model in the Infrared Modality	Feb 1, 2024	DecoderSelf-Supervised Learning	CodeCode Available	2	5
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond	Dec 31, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	2	5
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders	Aug 19, 2023	Inductive BiasMotion Forecasting	CodeCode Available	2	5
A Simple Framework for Contrastive Learning of Visual Representations	Feb 13, 2020	Contrastive LearningImage Classification	CodeCode Available	2	5
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing	Mar 13, 2025	Computational EfficiencyMamba	CodeCode Available	2	5
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning	Dec 16, 2024	DeepFake Detectiondiffusion-generated faces detection	CodeCode Available	2	5
Equivariant Multi-Modality Image Fusion	May 19, 2023	Self-Supervised Learning	CodeCode Available	2	5
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision	Jul 25, 2024	DiversityMedical Image Analysis	CodeCode Available	2	5
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation	Apr 2, 2024	3D Pose EstimationPose Estimation	CodeCode Available	2	5
Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification	Aug 13, 2022	Contrastive LearningData Augmentation	CodeCode Available	2	5
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects	Jun 16, 2023	Anomaly DetectionSelf-Supervised Learning	CodeCode Available	2	5
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture	Jan 19, 2023	Depth EstimationDepth Prediction	CodeCode Available	2	5
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask	May 9, 2024	Anomaly DetectionImputation	CodeCode Available	2	5
Self-Supervised Log Parsing	Mar 17, 2020	Anomaly DetectionFault Detection	CodeCode Available	2	5
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Nov 19, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2	5
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition	Feb 20, 2024	Emotion RecognitionSelf-Supervised Learning	CodeCode Available	2	5
A generalizable 3D framework and model for self-supervised learning in medical imaging	Jan 20, 2025	Medical Image SegmentationSelf-Supervised Learning	CodeCode Available	2	5
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders	Mar 26, 2024	ObjectSelf-Supervised Learning	CodeCode Available	2	5
A Foundation Model for Music Informatics	Nov 6, 2023	Information Retrievalmodel	CodeCode Available	2	5
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection	Jun 12, 2024	Computational EfficiencySelf-Supervised Learning	CodeCode Available	2	5
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones	Nov 17, 2022	Data AugmentationSelf-Supervised Learning	CodeCode Available	2	5
EMP-SSL: Towards Self-Supervised Learning in One Training Epoch	Apr 8, 2023	QuantizationSelf-Supervised Learning	CodeCode Available	2	5
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment	Jan 16, 2024	DisentanglementSelf-Supervised Learning	CodeCode Available	2	5
DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation	Dec 30, 2022	Font GenerationImage-to-Image Translation	CodeCode Available	2	5
DiffMM: Multi-Modal Diffusion Model for Recommendation	Jun 17, 2024	Contrastive Learningmodel	CodeCode Available	2	5
Dynamic 3D Point Cloud Sequences as 2D Videos	Mar 2, 2024	Action RecognitionSelf-Supervised Learning	CodeCode Available	2	5
Deconstructing Denoising Diffusion Models for Self-Supervised Learning	Jan 25, 2024	DenoisingImage Generation	CodeCode Available	2	5
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing	Jan 29, 2024	GPURepresentation Learning	CodeCode Available	2	5
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2	5
DM-Codec: Distilling Multimodal Representations for Speech Tokenization	Oct 19, 2024	Self-Supervised LearningSpeech Tokenization	CodeCode Available	2	5
Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel Fusion	May 7, 2022	Hyperspectral Image Super-ResolutionImage Super-Resolution	CodeCode Available	2	5
Attention Mechanisms in Computer Vision: A Survey	Nov 15, 2021	image-classificationImage Classification	CodeCode Available	2	5
A Multimodal Vision Foundation Model for Clinical Dermatology	Oct 19, 2024	DiagnosticLesion Segmentation	CodeCode Available	2	5
A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications	Sep 26, 2024	EEGSelf-Supervised Learning	CodeCode Available	2	5
Multistain Pretraining for Slide Representation Learning in Pathology	Aug 5, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	2	5
A Survey on Mixup Augmentations and Beyond	Sep 8, 2024	Image ClassificationSelf-Supervised Learning	CodeCode Available	2	5
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks	Sep 10, 2024	Contrastive LearningImage Reconstruction	CodeCode Available	2	5
Contrastive Audio-Visual Masked Autoencoder	Oct 2, 2022	Audio ClassificationAudio Tagging	CodeCode Available	2	5

Show:10 25 50

← PrevPage 3 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified