Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 5044 papers

Title	Date	Tasks	Status	Hype
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations	Jun 20, 2020	QuantizationSelf-Supervised Learning	CodeCode Available	3
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes	Jun 13, 2025	Linear evaluationSelf-Supervised Learning	CodeCode Available	2
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite Imageries	Jun 11, 2025	SegmentationSelf-Supervised Learning	CodeCode Available	2
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	May 29, 2025	Self-Supervised LearningVideo Generation	CodeCode Available	2
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing	Mar 13, 2025	Computational EfficiencyMamba	CodeCode Available	2
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection	Jan 23, 2025	object-detectionObject Detection	CodeCode Available	2
A generalizable 3D framework and model for self-supervised learning in medical imaging	Jan 20, 2025	Medical Image SegmentationSelf-Supervised Learning	CodeCode Available	2
Scaling up self-supervised learning for improved surgical foundation models	Jan 16, 2025	Self-Supervised LearningSemantic Segmentation	CodeCode Available	2
An OpenMind for 3D medical vision self-supervised learning	Dec 22, 2024	BenchmarkingSelf-Supervised Learning	CodeCode Available	2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning	Dec 16, 2024	DeepFake Detectiondiffusion-generated faces detection	CodeCode Available	2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Nov 19, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Oct 30, 2024	General Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2
PaPaGei: Open Foundation Models for Optical Physiological Signals	Oct 27, 2024	Contrastive LearningDomain Generalization	CodeCode Available	2
TabDPT: Scaling Tabular Foundation Models	Oct 23, 2024	In-Context LearningSelf-Supervised Learning	CodeCode Available	2
TIPS: Text-Image Pretraining with Spatial Awareness	Oct 21, 2024	Depth EstimationImage Captioning	CodeCode Available	2
DM-Codec: Distilling Multimodal Representations for Speech Tokenization	Oct 19, 2024	Self-Supervised LearningSpeech Tokenization	CodeCode Available	2
A Multimodal Vision Foundation Model for Clinical Dermatology	Oct 19, 2024	DiagnosticLesion Segmentation	CodeCode Available	2
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective	Oct 16, 2024	Conditional Image GenerationImage Generation	CodeCode Available	2
Sylber: Syllabic Embedding Representation of Speech from Raw Audio	Oct 9, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications	Sep 26, 2024	EEGSelf-Supervised Learning	CodeCode Available	2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection	Sep 26, 2024	Event DetectionRepresentation Learning	CodeCode Available	2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks	Sep 10, 2024	Contrastive LearningImage Reconstruction	CodeCode Available	2
A Survey on Mixup Augmentations and Beyond	Sep 8, 2024	Image ClassificationSelf-Supervised Learning	CodeCode Available	2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings	Aug 25, 2024	Language ModellingLink Prediction	CodeCode Available	2
PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders	Aug 16, 2024	3D Object Classification3D Point Cloud Classification	CodeCode Available	2
Snuffy: Efficient Whole Slide Image Classifier	Aug 15, 2024	Breast Cancer DetectionLung Cancer Diagnosis	CodeCode Available	2
Multistain Pretraining for Slide Representation Learning in Pathology	Aug 5, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation	Aug 5, 2024	RhythmSelf-Supervised Learning	CodeCode Available	2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision	Jul 25, 2024	DiversityMedical Image Analysis	CodeCode Available	2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jul 19, 2024	Data AugmentationDepth Estimation	CodeCode Available	2
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data	Jul 10, 2024	Contrastive Learningmultimodal interaction	CodeCode Available	2
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2
DiffMM: Multi-Modal Diffusion Model for Recommendation	Jun 17, 2024	Contrastive Learningmodel	CodeCode Available	2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios	Jun 13, 2024	Language IdentificationSelf-Supervised Learning	CodeCode Available	2
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection	Jun 12, 2024	Computational EfficiencySelf-Supervised Learning	CodeCode Available	2
XRec: Large Language Models for Explainable Recommendation	Jun 4, 2024	Collaborative FilteringDecision Making	CodeCode Available	2
SelfGNN: Self-Supervised Graph Neural Networks for Sequential Recommendation	May 31, 2024	Graph Neural NetworkRecommendation Systems	CodeCode Available	2
Transcriptomics-guided Slide Representation Learning in Computational Pathology	May 19, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask	May 9, 2024	Anomaly DetectionImputation	CodeCode Available	2
The Entropy Enigma: Success and Failure of Entropy Minimization	May 8, 2024	Self-Supervised Learning	CodeCode Available	2
Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations	May 3, 2024	Optical Flow EstimationReference-based Super-Resolution	CodeCode Available	2
TFPred: Learning Discriminative Representations from Unlabeled Data for Few-Label Rotating Machinery Fault Diagnosis	May 1, 2024	Fault DetectionFault Diagnosis	CodeCode Available	2
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images	Apr 20, 2024	DiagnosticMamba	CodeCode Available	2
Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology	Apr 16, 2024	Drug DiscoverySelf-Supervised Learning	CodeCode Available	2
MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild	Apr 13, 2024	cross-modal alignmentDynamic Facial Expression Recognition	CodeCode Available	2
OmniSat: Self-Supervised Modality Fusion for Earth Observation	Apr 12, 2024	DiversityEarth Observation	CodeCode Available	2
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG	Apr 10, 2024	Contrastive LearningEEG	CodeCode Available	2
Test-Time Zero-Shot Temporal Action Localization	Apr 8, 2024	Action LocalizationLanguage Modelling	CodeCode Available	2
MedIAnomaly: A comparative study of anomaly detection in medical images	Apr 6, 2024	Anomaly ClassificationAnomaly Detection	CodeCode Available	2
A Comprehensive Survey on Self-Supervised Learning for Recommendation	Apr 4, 2024	Contrastive LearningRecommendation Systems	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified