SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 40014050 of 5044 papers

TitleStatusHype
Visible and infrared self-supervised fusion trained on a single example0
Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft0
Vision Learners Meet Web Image-Text Pairs0
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision0
Vision Transformers: State of the Art and Research Challenges0
Visual Lexicon: Rich Image Features in Language Space0
Visually Guided Self Supervised Learning of Speech Representations0
Visual Representation Learning with Stochastic Frame Prediction0
Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations0
ViTAR: Vision Transformer with Any Resolution0
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer0
VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning0
VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment0
VRMM: A Volumetric Relightable Morphable Head Model0
Watching Too Much Television is Good: Self-Supervised Audio-Visual Representation Learning from Movies and TV Shows0
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR0
Wav2Vec-Aug: Improved self-supervised training with limited data0
Wav2vec-C: A Self-supervised Model for Speech Representation Learning0
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition0
Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation0
WavFT: Acoustic model finetuning with labelled and unlabelled data0
Weakly Augmented Variational Autoencoder in Time Series Anomaly Detection0
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows0
Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving0
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition0
Weakly-Supervised Surgical Phase Recognition0
WeakSTIL: Weak whole-slide image level stromal tumor infiltrating lymphocyte scores are all you need0
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation0
Wearable-Based Real-time Freezing of Gait Detection in Parkinson's Disease Using Self-Supervised Learning0
Wearable data from subjects playing Super Mario, sitting university exams, or performing physical exercise help detect acute mood episodes via self-supervised learning0
WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets0
WeedNet: A Foundation Model-Based Global-to-Local AI Approach for Real-Time Weed Species Identification and Classification0
Weighted Ensemble Self-Supervised Learning0
WeLM: A Well-Read Pre-trained Language Model for Chinese0
WERank: Towards Rank Degradation Prevention for Self-Supervised Learning Using Weight Regularization0
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model0
What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis0
What Do We Maximize in Self-Supervised Learning?0
What I See Is What You See: Joint Attention Learning for First and Third Person Video Co-analysis0
What shapes the loss landscape of self-supervised learning?0
What to align in multimodal contrastive learning?0
When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery0
When Does Contrastive Visual Representation Learning Work?0
When do neural networks learn world models?0
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?0
Wild Visual Navigation: Fast Traversability Learning via Pre-Trained Models and Online Self-Supervision0
With Great Backbones Comes Great Adversarial Transferability0
XAI for Self-supervised Clustering of Wireless Spectrum Activity0
XLA: A Robust Unsupervised Data Augmentation Framework for Cross-Lingual NLP0
XLVIN: eXecuted Latent Value Iteration Nets0
Show:102550
← PrevPage 81 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified