Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3951–4000 of 5044 papers

Title	Date	Tasks	Status
Unsupervised learning based object detection using Contrastive Learning	Feb 21, 2024	Contrastive LearningObject	—Unverified
Unsupervised Learning of Dense Visual Representations	Nov 11, 2020	Contrastive LearningData Augmentation	—Unverified
Unsupervised Learning on a DIET: Datum IndEx as Target Free of Self-Supervision, Reconstruction, Projector Head	Feb 20, 2023	DecoderSelf-Supervised Learning	—Unverified
Unsupervised Machine Learning for Osteoporosis Diagnosis Using Singh Index Clustering on Hip Radiographs	Nov 22, 2024	DiagnosticSelf-Supervised Learning	—Unverified
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning	Apr 14, 2020	DecoderSelf-Supervised Learning	—Unverified
Self-Supervised Learning of Context-Aware Pitch Prosody Representations	Jul 17, 2020	General ClassificationInformation Retrieval	—Unverified
Unsupervised Skill-Discovery and Skill-Learning in Minecraft	Jul 18, 2021	MinecraftSelf-Supervised Learning	—Unverified
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding	Oct 5, 2021	Boundary DetectionRepresentation Learning	—Unverified
Unsupervised Time-Series Signal Analysis with Autoencoders and Vision Transformers: A Review of Architectures and Applications	Apr 23, 2025	Anomaly DetectionDomain Generalization	—Unverified
Unsupervised Transfer Learning via Adversarial Contrastive Training	Aug 16, 2024	Data AugmentationSelf-Supervised Learning	—Unverified
Unsupervised Transfer Learning with Self-Supervised Remedy	Jun 8, 2020	ClusteringDomain Adaptation	—Unverified
Unsupervised View-Invariant Human Posture Representation	Sep 17, 2021	3D Action Recognition3D Pose Estimation	—Unverified
Unsupervised Waste Classification By Dual-Encoder Contrastive Learning and Multi-Clustering Voting (DECMCV)	Mar 4, 2025	ClassificationContrastive Learning	—Unverified
Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning	Oct 27, 2023	Out-of-Distribution DetectionSelf-Supervised Learning	—Unverified
UPRec: User-Aware Pre-training for Recommender Systems	Feb 22, 2021	Recommendation SystemsSelf-Supervised Learning	—Unverified
USAD: Universal Speech and Audio Representation via Distillation	Jun 23, 2025	Audio TaggingRepresentation Learning	—Unverified
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models	Oct 2, 2023	DenoisingSelf-Supervised Learning	—Unverified
User-LLM: Efficient LLM Contextualization with User Embeddings	Feb 21, 2024	Self-Supervised Learning	—Unverified
Using Deep Learning with Large Aggregated Datasets for COVID-19 Classification from Cough	Jan 5, 2022	COVID-19 DiagnosisSelf-Supervised Learning	—Unverified
Using Navigational Information to Learn Visual Representations	Feb 10, 2022	Contrastive LearningRepresentation Learning	—Unverified
Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation	May 13, 2021	Emotion RecognitionFacial Emotion Recognition	—Unverified
Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video	May 4, 2023	Contrastive LearningSelf-Supervised Learning	—Unverified
Utilizing Cross-Modal Contrastive Learning to Improve Item Categorization BERT Model	May 1, 2022	Contrastive LearningSelf-Supervised Learning	—Unverified
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata	Sep 18, 2023	Multi-Task LearningPrediction	—Unverified
UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation	Jun 16, 2023	Domain AdaptationMultiple Object Tracking	—Unverified
Variance Covariance Regularization Enforces Pairwise Independence in Self-Supervised Representations	Sep 29, 2022	Domain GeneralizationSelf-Supervised Learning	—Unverified
Variance-Covariance Regularization Improves Representation Learning	Jun 23, 2023	Long-tail LearningRepresentation Learning	—Unverified
Variance-reduced Language Pretraining via a Mask Proposal Network	Aug 12, 2020	Self-Supervised LearningSentence	—Unverified
Variational Monocular Depth Estimation for Reliability Prediction	Nov 24, 2020	Autonomous VehiclesDepth Estimation	—Unverified
Variational Self-Supervised Contrastive Learning Using Beta Divergence	Sep 5, 2023	Face RecognitionLinear evaluation	—Unverified
Variational Self-Supervised Learning	Apr 6, 2025	DecoderDenoising	—Unverified
Vector-Symbolic Architecture for Event-Based Optical Flow	May 14, 2024	Event-based Optical FlowOptical Flow Estimation	—Unverified
Very High-Resolution Forest Mapping with TanDEM-X InSAR Data and Self-Supervised Learning	May 6, 2025	Self-Supervised Learning	—Unverified
VET-DINO: Learning Anatomical Understanding Through Multi-View Distillation in Veterinary Imaging	May 21, 2025	Self-Supervised Learning	—Unverified
Computer Vision Self-supervised Learning Methods on Time Series	Sep 2, 2021	Self-Supervised LearningTime Series	—Unverified
Video as the New Language for Real-World Decision Making	Feb 27, 2024	Decision MakingIn-Context Learning	—Unverified
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound	Aug 21, 2024	Audio GenerationAudio Synthesis	—Unverified
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition	Aug 22, 2018	Action RecognitionActivity Recognition	—Unverified
Video Representation Learning by Recognizing Temporal Transformations	Jul 21, 2020	Action RecognitionRepresentation Learning	—Unverified
Video Transformers: A Survey	Jan 16, 2022	Action ClassificationSelf-Supervised Learning	—Unverified
Video Understanding as Machine Translation	Jun 12, 2020	Machine TranslationMetric Learning	—Unverified
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding	Mar 24, 2025	8kGPU	—Unverified
VieSum: How Robust Are Transformer-based Models on Vietnamese Summarization?	Oct 8, 2021	Abstractive Text SummarizationDecoder	—Unverified
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining	May 23, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ViewMix: Augmentation for Robust Representation in Self-Supervised Learning	Sep 6, 2023	Representation LearningSelf-Supervised Learning	—Unverified
ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation	Dec 1, 2022	Image ReconstructionSelf-Supervised Learning	—Unverified
Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation	May 28, 2024	Representation LearningSelf-Supervised Learning	—Unverified
VIGraph: Generative Self-supervised Learning for Class-Imbalanced Node Classification	Nov 2, 2023	Contrastive LearningNode Classification	—Unverified
Vi-MIX FOR SELF-SUPERVISED VIDEO REPRESENTATION	Sep 29, 2021	Action RecognitionRepresentation Learning	—Unverified
Virtual Node Generation for Node Classification in Sparsely-Labeled Graphs	Sep 12, 2024	Graph LearningMeta-Learning	—Unverified

Show:10 25 50

← PrevPage 80 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified