SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 101150 of 5044 papers

TitleStatusHype
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose EstimationCode2
Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersCode2
Towards Large-Scale Training of Pathology Foundation ModelsCode2
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEsCode2
A Versatile Framework for Multi-scene Person Re-identificationCode2
BirdSet: A Large-Scale Dataset for Audio Classification in Avian BioacousticsCode2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Dynamic 3D Point Cloud Sequences as 2D VideosCode2
EMO-SUPERB: An In-depth Look at Speech Emotion RecognitionCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of ElectrocardiogramCode2
InfMAE: A Foundation Model in the Infrared ModalityCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural CalibrationCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion RecognitionCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
Low-resource finetuning of foundation models beats state-of-the-art in histopathologyCode2
PhilEO Bench: Evaluating Geo-Spatial Foundation ModelsCode2
Imagine Before Go: Self-Supervised Generative Map for Object Goal NavigationCode2
Masked Modeling for Self-supervised Representation Learning on Vision and BeyondCode2
PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC StainsCode2
High-Performance Transformers for Table Structure Recognition Need Early ConvolutionsCode2
A Foundation Model for Music InformaticsCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked AutoencodersCode2
SSLRec: A Self-Supervised Learning Framework for RecommendationCode2
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated DatasetCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and ProspectsCode2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series ForecastingCode2
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised TrainingCode2
Pengi: An Audio Language Model for Audio TasksCode2
Equivariant Multi-Modality Image FusionCode2
Lightweight, Pre-trained Transformers for Remote Sensing TimeseriesCode2
Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial LidarCode2
GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph LearnerCode2
Slideflow: Deep Learning for Digital Histopathology with Real-Time Whole-Slide VisualizationCode2
EMP-SSL: Towards Self-Supervised Learning in One Training EpochCode2
Self-Supervised Multimodal Learning: A SurveyCode2
Automated Self-Supervised Learning for RecommendationCode2
Stabilizing Transformer Training by Preventing Attention Entropy CollapseCode2
Towards Democratizing Joint-Embedding Self-Supervised LearningCode2
Multi-Modal Self-Supervised Learning for RecommendationCode2
ClimaX: A foundation model for weather and climateCode2
Self-Supervised Learning from Images with a Joint-Embedding Predictive ArchitectureCode2
Show:102550
← PrevPage 3 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified