SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 12511300 of 5044 papers

TitleStatusHype
HR-Depth: High Resolution Self-Supervised Monocular Depth EstimationCode1
eProduct: A Million-Scale Visual Search Benchmark to Address Product Recognition ChallengesCode1
Human-machine Interactive Tissue Prototype Learning for Label-efficient Histopathology Image SegmentationCode1
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image SegmentationCode1
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language ModelingCode1
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological TextCode1
Benchmarking Omni-Vision Representation through the Lens of Visual RealmsCode1
Hyper-Representations for Pre-Training and Transfer LearningCode1
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio RepresentationCode1
EchoFM: Foundation Model for Generalizable Echocardiogram AnalysisCode1
Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural RepresentationCode1
Echo-SyncNet: Self-supervised Cardiac View Synchronization in EchocardiographyCode1
Deep learning powered real-time identification of insects using citizen science dataCode1
APSNet: Attention Based Point Cloud SamplingCode1
Enhanced Masked Image Modeling to Avoid Model Collapse on Multi-modal MRI DatasetsCode1
Implicit Autoencoder for Point-Cloud Self-Supervised Representation LearningCode1
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-IdentificationCode1
Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous GraphsCode1
Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised trainingCode1
Effective Self-supervised Pre-training on Low-compute Networks without DistillationCode1
EnCodecMAE: Leveraging neural codecs for universal audio representation learningCode1
Efficiency for Free: Ideal Data Are Transportable RepresentationsCode1
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech RecognitionCode1
A Random CNN Sees Objects: One Inductive Bias of CNN and Its ApplicationsCode1
End-to-end Multi-modal Video Temporal GroundingCode1
Overcoming Data Limitations: A Few-Shot Specific Emitter Identification Method Using Self-Supervised Learning and Adversarial AugmentationCode1
Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill LearningCode1
Improving Adaptive Conformal Prediction Using Self-Supervised LearningCode1
Improving Knowledge-aware Recommendation with Multi-level Interactive Contrastive LearningCode1
Improving Medical Image Classification in Noisy Labels Using Only Self-supervised PretrainingCode1
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data PerspectiveCode1
Improving Label-Deficient Keyword Spotting Through Self-Supervised PretrainingCode1
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentCode1
Efficient Representation Learning for Healthcare with Cross-Architectural Self-SupervisionCode1
Improving Self-Supervised Learning by Characterizing Idealized RepresentationsCode1
Improving Representation Learning for Histopathologic Images with Cluster ConstraintsCode1
End-to-end Multiple Instance Learning with Gradient AccumulationCode1
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and LanguageCode1
Efficient Self-Supervised Video Hashing with Selective State SpacesCode1
Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
Improving Self-supervised Pre-training using Accent-Specific CodebooksCode1
Overcoming Language Priors with Self-supervised Learning for Visual Question AnsweringCode1
Energy-Based Contrastive Learning of Visual RepresentationsCode1
Adversarial Graph Augmentation to Improve Graph Contrastive LearningCode1
IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series PredictionCode1
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensCode1
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation LearningCode1
A Regularization-Guided Equivariant Approach for Image RestorationCode1
Optimal Representations for Covariate ShiftCode1
Show:102550
← PrevPage 26 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified