SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 51100 of 5044 papers

TitleStatusHype
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech RepresentationsCode3
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic SoundscapesCode2
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite ImageriesCode2
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation ModelsCode2
RoMA: Scaling up Mamba-based Foundation Models for Remote SensingCode2
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object DetectionCode2
A generalizable 3D framework and model for self-supervised learning in medical imagingCode2
Scaling up self-supervised learning for improved surgical foundation modelsCode2
An OpenMind for 3D medical vision self-supervised learningCode2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
PaPaGei: Open Foundation Models for Optical Physiological SignalsCode2
TabDPT: Scaling Tabular Foundation ModelsCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
DM-Codec: Distilling Multimodal Representations for Speech TokenizationCode2
A Multimodal Vision Foundation Model for Clinical DermatologyCode2
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified PerspectiveCode2
Sylber: Syllabic Embedding Representation of Speech from Raw AudioCode2
A Survey of Spatio-Temporal EEG data Analysis: from Models to ApplicationsCode2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
A Survey on Mixup Augmentations and BeyondCode2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsCode2
PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersCode2
Snuffy: Efficient Whole Slide Image ClassifierCode2
Multistain Pretraining for Slide Representation Learning in PathologyCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer VisionCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete DataCode2
Diffusion Models and Representation Learning: A SurveyCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource ScenariosCode2
Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing DetectionCode2
XRec: Large Language Models for Explainable RecommendationCode2
SelfGNN: Self-Supervised Graph Neural Networks for Sequential RecommendationCode2
Transcriptomics-guided Slide Representation Learning in Computational PathologyCode2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting MaskCode2
The Entropy Enigma: Success and Failure of Entropy MinimizationCode2
Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed ObservationsCode2
TFPred: Learning Discriminative Representations from Unlabeled Data for Few-Label Rotating Machinery Fault DiagnosisCode2
Vim4Path: Self-Supervised Vision Mamba for Histopathology ImagesCode2
Masked Autoencoders for Microscopy are Scalable Learners of Cellular BiologyCode2
MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wildCode2
OmniSat: Self-Supervised Modality Fusion for Earth ObservationCode2
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEGCode2
Test-Time Zero-Shot Temporal Action LocalizationCode2
MedIAnomaly: A comparative study of anomaly detection in medical imagesCode2
A Comprehensive Survey on Self-Supervised Learning for RecommendationCode2
Show:102550
← PrevPage 2 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified