Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 5044 papers

Title	Date	Tasks	Status	Hype
IMPA-HGAE:Intra-Meta-Path Augmented Heterogeneous Graph Autoencoder	Jun 7, 2025	Representation LearningSelf-Supervised Learning	—Unverified	0
Graph Neural Networks in Modern AI-aided Drug Discovery	Jun 7, 2025	Drug Discoverygraph construction	—Unverified	0
EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras	Jun 7, 2025	DeblurringMotion Segmentation	—Unverified	0
When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning	Jun 6, 2025	Contrastive LearningInference Attack	CodeCode Available	0
TADA: Training-free Attribution and Out-of-Domain Detection of Audio Deepfakes	Jun 6, 2025	DeepFake DetectionFace Swapping	CodeCode Available	0
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning	Jun 6, 2025	Robot NavigationSelf-Supervised Learning	—Unverified	0
Rethinking Contrastive Learning in Session-based Recommendation	Jun 5, 2025	Contrastive LearningSelf-Supervised Learning	CodeCode Available	0
LSM-2: Learning from Incomplete Wearable Sensor Data	Jun 5, 2025	DiagnosticImputation	—Unverified	0
Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models	Jun 4, 2025	Self-Supervised Learning	—Unverified	0
Prosodic Structure Beyond Lexical Content: A Study of Self-Supervised Learning	Jun 3, 2025	Emotion RecognitionReading Comprehension	—Unverified	0
HGOT: Self-supervised Heterogeneous Graph Neural Network with Optimal Transport	Jun 3, 2025	Graph Neural NetworkNode Classification	—Unverified	0
Synthetic Speech Source Tracing using Metric Learning	Jun 3, 2025	Metric LearningSelf-Supervised Learning	—Unverified	0
Sounding Like a Winner? Prosodic Differences in Post-Match Interviews	Jun 2, 2025	Self-Supervised Learning	—Unverified	0
MoCA: Multi-modal Cross-masked Autoencoder for Digital Health Measurements	Jun 2, 2025	Self-Supervised Learning	—Unverified	0
Self-Supervised-ISAR-Net Enables Fast Sparse ISAR Imaging	Jun 1, 2025	Image ReconstructionSelf-Supervised Learning	—Unverified	0
GigaAM: Efficient Self-Supervised Learner for Speech Recognition	Jun 1, 2025	Automatic Speech RecognitionLanguage Modeling	CodeCode Available	4
HASRD: Hierarchical Acoustic and Semantic Representation Disentanglement	Jun 1, 2025	DisentanglementSelf-Supervised Learning	—Unverified	0
PARROT: Synergizing Mamba and Attention-based SSL Pre-Trained Models via Parallel Branch Hadamard Optimal Transport for Speech Emotion Recognition	Jun 1, 2025	Emotion RecognitionMamba	—Unverified	0
Getting More from Less: Transfer Learning Improves Sleep Stage Decoding Accuracy in Peripheral Wearable Devices	May 31, 2025	EEGSelf-Supervised Learning	—Unverified	0
Towards Unified Neural Decoding with Brain Functional Network Modeling	May 30, 2025	Data IntegrationSelf-Supervised Learning	—Unverified	0
Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization	May 30, 2025	GPUKnowledge Distillation	—Unverified	0
A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning	May 30, 2025	Contrastive LearningSelf-Supervised Learning	—Unverified	0
MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR	May 30, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge	May 30, 2025	Emotion RecognitionSelf-Supervised Learning	CodeCode Available	0
Sparsity-Driven Parallel Imaging Consistency for Improved Self-Supervised MRI Reconstruction	May 30, 2025	MRI ReconstructionSelf-Supervised Learning	—Unverified	0
A Survey of Generative Categories and Techniques in Multimodal Large Language Models	May 29, 2025	Mixture-of-ExpertsSelf-Supervised Learning	—Unverified	0
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	May 29, 2025	Self-Supervised LearningVideo Generation	CodeCode Available	2
Graph Positional Autoencoders as Self-supervised Learners	May 29, 2025	Graph Property PredictionMissing Elements	—Unverified	0
Subgraph Gaussian Embedding Contrast for Self-Supervised Graph Representation Learning	May 29, 2025	Contrastive LearningGraph Representation Learning	CodeCode Available	0
Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM	May 29, 2025	Action DetectionActivity Detection	—Unverified	0
Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition	May 29, 2025	Action ClassificationAction Recognition	CodeCode Available	0
IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction	May 28, 2025	Missing ValuesSelf-Supervised Learning	CodeCode Available	1
ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding	May 27, 2025	Computational EfficiencyMamba	—Unverified	0
Pretraining Language Models to Ponder in Continuous Space	May 27, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
CellCLAT: Preserving Topology and Trimming Redundancy in Self-Supervised Cellular Contrastive Learning	May 27, 2025	Contrastive LearningGraph Learning	CodeCode Available	0
Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis	May 27, 2025	Accented Speech RecognitionSelf-Supervised Learning	—Unverified	0
Supervised and self-supervised land-cover segmentation & classification of the Biesbosch wetlands	May 27, 2025	Land Cover ClassificationSelf-Supervised Learning	—Unverified	0
Training Articulatory Inversion Models for Interspeaker Consistency	May 26, 2025	Self-Supervised Learning	—Unverified	0
Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning	May 26, 2025	image-classificationImage Classification	—Unverified	0
Causality and "In-the-Wild" Video-Based Person Re-ID: A Survey	May 26, 2025	counterfactualCounterfactual Reasoning	—Unverified	0
A Contrastive Learning Foundation Model Based on Perfectly Aligned Sample Pairs for Remote Sensing Images	May 26, 2025	Contrastive LearningSelf-Supervised Learning	—Unverified	0
A Regularization-Guided Equivariant Approach for Image Restoration	May 26, 2025	Data AugmentationImage Restoration	CodeCode Available	1
Automated data curation for self-supervised learning in underwater acoustic analysis	May 26, 2025	Self-Supervised Learning	—Unverified	0
Advancing Video Self-Supervised Learning via Image Foundation Models	May 25, 2025	GPURepresentation Learning	CodeCode Available	0
Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation	May 25, 2025	Computed Tomography (CT)Contrastive Learning	CodeCode Available	0
WeedNet: A Foundation Model-Based Global-to-Local AI Approach for Real-Time Weed Species Identification and Classification	May 25, 2025	Self-Supervised Learning	—Unverified	0
Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation	May 25, 2025	DisentanglementSelf-Supervised Learning	—Unverified	0
Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction	May 24, 2025	intent-classificationIntent Classification	—Unverified	0
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining	May 23, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Task-Optimized Convolutional Recurrent Networks Align with Tactile Processing in the Rodent Brain	May 23, 2025	Self-Supervised Learning	—Unverified	0

Show:10 25 50

← PrevPage 2 of 101Next →

All datasets DABS STL-10 CIFAR10 cifar100 ImageNet-100 (TEMI Split)TinyImageNet CIFAR-10 CIFAR-100 CREMA-D Tiny ImageNet

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Pretraining: None	Images & Text	57.5	—	Unverified
2	Pretraining: ShED	Images & Text	54.3	—	Unverified
3	Pretraining: e-Mix	Images & Text	48.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	Accuracy	91.7	—	Unverified
2	ResNet18	Accuracy	91.02	—	Unverified
3	MV-MR	Accuracy	89.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	93.89	—	Unverified
2	ResNet18	average top-1 classification accuracy	92.58	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	72.51	—	Unverified
2	ResNet18	average top-1 classification accuracy	69.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	82.64	—	Unverified
2	CorInfomax (ResNet18)	Top-1 Accuracy	80.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	average top-1 classification accuracy	51.84	—	Unverified
2	ResNet18	average top-1 classification accuracy	51.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	93.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet18)	Top-1 Accuracy	71.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hybrid BYOL-S/CvT	Accuracy	67.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CorInfomax (ResNet50)	Top-1 Accuracy	54.86	—	Unverified