Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 5044 papers

Title	Date	Tasks	Status	Hype
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation	Apr 2, 2024	3D Pose EstimationPose Estimation	CodeCode Available	2
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders	Mar 26, 2024	ObjectSelf-Supervised Learning	CodeCode Available	2
Towards Large-Scale Training of Pathology Foundation Models	Mar 24, 2024	Nuclear SegmentationSelf-Supervised Learning	CodeCode Available	2
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs	Mar 19, 2024	Few-Shot LearningSelf-Supervised Learning	CodeCode Available	2
A Versatile Framework for Multi-scene Person Re-identification	Mar 17, 2024	Data AugmentationPerson Re-Identification	CodeCode Available	2
BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics	Mar 15, 2024	Audio ClassificationClassification	CodeCode Available	2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement	Mar 11, 2024	Clinical KnowledgeDescriptive	CodeCode Available	2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV	Mar 3, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	2
Dynamic 3D Point Cloud Sequences as 2D Videos	Mar 2, 2024	Action RecognitionSelf-Supervised Learning	CodeCode Available	2
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition	Feb 20, 2024	Emotion RecognitionSelf-Supervised Learning	CodeCode Available	2
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection	Feb 5, 2024	Objectobject-detection	CodeCode Available	2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2
InfMAE: A Foundation Model in the Infrared Modality	Feb 1, 2024	DecoderSelf-Supervised Learning	CodeCode Available	2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing	Jan 29, 2024	GPURepresentation Learning	CodeCode Available	2
Deconstructing Denoising Diffusion Models for Self-Supervised Learning	Jan 25, 2024	DenoisingImage Generation	CodeCode Available	2
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration	Jan 23, 2024	3D Semantic SegmentationAutonomous Driving	CodeCode Available	2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment	Jan 16, 2024	DisentanglementSelf-Supervised Learning	CodeCode Available	2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition	Jan 11, 2024	Contrastive LearningDynamic Facial Expression Recognition	CodeCode Available	2
Singer Identity Representation Learning using Self-Supervised Techniques	Jan 10, 2024	Domain GeneralizationRepresentation Learning	CodeCode Available	2
Low-resource finetuning of foundation models beats state-of-the-art in histopathology	Jan 9, 2024	GPUSelf-Supervised Learning	CodeCode Available	2
PhilEO Bench: Evaluating Geo-Spatial Foundation Models	Jan 9, 2024	Density EstimationEarth Observation	CodeCode Available	2
Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation	Jan 1, 2024	General KnowledgeNavigate	CodeCode Available	2
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond	Dec 31, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	2
PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains	Dec 15, 2023	Self-Supervised Learning	CodeCode Available	2
High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Nov 9, 2023	DecoderRepresentation Learning	CodeCode Available	2
A Foundation Model for Music Informatics	Nov 6, 2023	Information Retrievalmodel	CodeCode Available	2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks	Oct 30, 2023	Benchmarkingobject-detection	CodeCode Available	2
GraphGPT: Graph Instruction Tuning for Large Language Models	Oct 19, 2023	Data AugmentationGraph Learning	CodeCode Available	2
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving	Oct 12, 2023	3D Object Detection3D Semantic Segmentation	CodeCode Available	2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders	Aug 19, 2023	Inductive BiasMotion Forecasting	CodeCode Available	2
SSLRec: A Self-Supervised Learning Framework for Recommendation	Aug 10, 2023	Collaborative FilteringData Augmentation	CodeCode Available	2
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset	Jun 29, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	2
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing	Jun 19, 2023	ClassificationCross-Modal Retrieval	CodeCode Available	2
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects	Jun 16, 2023	Anomaly DetectionSelf-Supervised Learning	CodeCode Available	2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting	Jun 14, 2023	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	2
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training	May 31, 2023	Language ModellingQuantization	CodeCode Available	2
Pengi: An Audio Language Model for Audio Tasks	May 19, 2023	Audio captioningAudio Question Answering	CodeCode Available	2
Equivariant Multi-Modality Image Fusion	May 19, 2023	Self-Supervised Learning	CodeCode Available	2
Lightweight, Pre-trained Transformers for Remote Sensing Timeseries	Apr 27, 2023	Crop ClassificationSelf-Supervised Learning	CodeCode Available	2
Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar	Apr 14, 2023	DecoderSelf-Supervised Learning	CodeCode Available	2
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner	Apr 10, 2023	Self-Supervised Learning	CodeCode Available	2
Slideflow: Deep Learning for Digital Histopathology with Real-Time Whole-Slide Visualization	Apr 9, 2023	Deep LearningHistopathological Image Classification	CodeCode Available	2
EMP-SSL: Towards Self-Supervised Learning in One Training Epoch	Apr 8, 2023	QuantizationSelf-Supervised Learning	CodeCode Available	2
Self-Supervised Multimodal Learning: A Survey	Mar 31, 2023	Machine TranslationSelf-Supervised Learning	CodeCode Available	2
Automated Self-Supervised Learning for Recommendation	Mar 14, 2023	Collaborative FilteringContrastive Learning	CodeCode Available	2
Stabilizing Transformer Training by Preventing Attention Entropy Collapse	Mar 11, 2023	Automatic Speech Recognitionimage-classification	CodeCode Available	2
Towards Democratizing Joint-Embedding Self-Supervised Learning	Mar 3, 2023	Data AugmentationMisconceptions	CodeCode Available	2
Multi-Modal Self-Supervised Learning for Recommendation	Feb 21, 2023	Contrastive LearningData Augmentation	CodeCode Available	2
ClimaX: A foundation model for weather and climate	Jan 24, 2023	modelSelf-Supervised Learning	CodeCode Available	2
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture	Jan 19, 2023	Depth EstimationDepth Prediction	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified