Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 5044 papers

Title	Date	Tasks	Status	Hype
Pushing the limits of raw waveform speaker recognition	Mar 16, 2022	Self-Supervised LearningSpeaker Recognition	CodeCode Available	3
Masked Siamese Networks for Label-Efficient Learning	Apr 14, 2022	image-classificationImage Classification	CodeCode Available	2
A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications	Sep 26, 2024	EEGSelf-Supervised Learning	CodeCode Available	2
Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology	Apr 16, 2024	Drug DiscoverySelf-Supervised Learning	CodeCode Available	2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model	Jun 14, 2022	Decision MakingNews Classification	CodeCode Available	2
Low-resource finetuning of foundation models beats state-of-the-art in histopathology	Jan 9, 2024	GPUSelf-Supervised Learning	CodeCode Available	2
Lightweight, Pre-trained Transformers for Remote Sensing Timeseries	Apr 27, 2023	Crop ClassificationSelf-Supervised Learning	CodeCode Available	2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings	Aug 25, 2024	Language ModellingLink Prediction	CodeCode Available	2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Oct 30, 2024	General Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2
A Survey on Mixup Augmentations and Beyond	Sep 8, 2024	Image ClassificationSelf-Supervised Learning	CodeCode Available	2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV	Mar 3, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	2
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond	Dec 31, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	2
InfMAE: A Foundation Model in the Infrared Modality	Feb 1, 2024	DecoderSelf-Supervised Learning	CodeCode Available	2
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers	Apr 20, 2022	DisentanglementSelf-Supervised Learning	CodeCode Available	2
Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions	Apr 1, 2022	Self-Supervised Learning	CodeCode Available	2
Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting	Jan 2, 2023	3D Object DetectionMotion Forecasting	CodeCode Available	2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning	Oct 24, 2022	GPUSelf-Supervised Learning	CodeCode Available	2
Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation	Jan 1, 2024	General KnowledgeNavigate	CodeCode Available	2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition	Jan 11, 2024	Contrastive LearningDynamic Facial Expression Recognition	CodeCode Available	2
A Simple Framework for Contrastive Learning of Visual Representations	Feb 13, 2020	Contrastive LearningImage Classification	CodeCode Available	2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2
High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Nov 9, 2023	DecoderRepresentation Learning	CodeCode Available	2
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow	Nov 18, 2022	Optical Flow EstimationPosition	CodeCode Available	2
MedIAnomaly: A comparative study of anomaly detection in medical images	Apr 6, 2024	Anomaly ClassificationAnomaly Detection	CodeCode Available	2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Nov 19, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
An OpenMind for 3D medical vision self-supervised learning	Dec 22, 2024	BenchmarkingSelf-Supervised Learning	CodeCode Available	2
A Comprehensive Survey on Self-Supervised Learning for Recommendation	Apr 4, 2024	Contrastive LearningRecommendation Systems	CodeCode Available	2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios	Jun 13, 2024	Language IdentificationSelf-Supervised Learning	CodeCode Available	2
EMP-SSL: Towards Self-Supervised Learning in One Training Epoch	Apr 8, 2023	QuantizationSelf-Supervised Learning	CodeCode Available	2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision	Jul 25, 2024	DiversityMedical Image Analysis	CodeCode Available	2
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders	Mar 26, 2024	ObjectSelf-Supervised Learning	CodeCode Available	2
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations	Sep 26, 2019	Common Sense ReasoningGPU	CodeCode Available	2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment	Jan 16, 2024	DisentanglementSelf-Supervised Learning	CodeCode Available	2
DM-Codec: Distilling Multimodal Representations for Speech Tokenization	Oct 19, 2024	Self-Supervised LearningSpeech Tokenization	CodeCode Available	2
Dynamic 3D Point Cloud Sequences as 2D Videos	Mar 2, 2024	Action RecognitionSelf-Supervised Learning	CodeCode Available	2
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones	Nov 17, 2022	Data AugmentationSelf-Supervised Learning	CodeCode Available	2
DiffMM: Multi-Modal Diffusion Model for Recommendation	Jun 17, 2024	Contrastive Learningmodel	CodeCode Available	2
DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation	Dec 30, 2022	Font GenerationImage-to-Image Translation	CodeCode Available	2
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2
Equivariant Multi-Modality Image Fusion	May 19, 2023	Self-Supervised Learning	CodeCode Available	2
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition	Feb 20, 2024	Emotion RecognitionSelf-Supervised Learning	CodeCode Available	2
GraphGPT: Graph Instruction Tuning for Large Language Models	Oct 19, 2023	Data AugmentationGraph Learning	CodeCode Available	2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders	Aug 19, 2023	Inductive BiasMotion Forecasting	CodeCode Available	2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning	Dec 16, 2024	DeepFake Detectiondiffusion-generated faces detection	CodeCode Available	2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding	Mar 1, 2022	3D Object Classification3D Point Cloud Linear Classification	CodeCode Available	2
GraphMAE: Self-Supervised Masked Graph Autoencoders	May 22, 2022	Contrastive LearningGraph Classification	CodeCode Available	2
A Foundation Model for Music Informatics	Nov 6, 2023	Information Retrievalmodel	CodeCode Available	2
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection	Feb 5, 2024	Objectobject-detection	CodeCode Available	2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing	Jan 29, 2024	GPURepresentation Learning	CodeCode Available	2
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation	May 27, 2022	Contrastive Learningimage-classification	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified