SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 51100 of 5044 papers

TitleStatusHype
The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic SpeechCode3
Masked Siamese Networks for Label-Efficient LearningCode2
A Survey of Spatio-Temporal EEG data Analysis: from Models to ApplicationsCode2
Masked Autoencoders for Microscopy are Scalable Learners of Cellular BiologyCode2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing ModelCode2
Low-resource finetuning of foundation models beats state-of-the-art in histopathologyCode2
Lightweight, Pre-trained Transformers for Remote Sensing TimeseriesCode2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
A Survey on Mixup Augmentations and BeyondCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Masked Modeling for Self-supervised Representation Learning on Vision and BeyondCode2
InfMAE: A Foundation Model in the Infrared ModalityCode2
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling SpeakersCode2
Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function PredictionsCode2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised LearningCode2
Imagine Before Go: Self-Supervised Generative Map for Object Goal NavigationCode2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion RecognitionCode2
A Simple Framework for Contrastive Learning of Visual RepresentationsCode2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of ElectrocardiogramCode2
High-Performance Transformers for Table Structure Recognition Need Early ConvolutionsCode2
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowCode2
MedIAnomaly: A comparative study of anomaly detection in medical imagesCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
An OpenMind for 3D medical vision self-supervised learningCode2
A Comprehensive Survey on Self-Supervised Learning for RecommendationCode2
EMP-SSL: Towards Self-Supervised Learning in One Training EpochCode2
Equivariant Multi-Modality Image FusionCode2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer VisionCode2
Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
DM-Codec: Distilling Multimodal Representations for Speech TokenizationCode2
Dynamic 3D Point Cloud Sequences as 2D VideosCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual BackbonesCode2
DGFont++: Robust Deformable Generative Networks for Unsupervised Font GenerationCode2
Diffusion Models and Representation Learning: A SurveyCode2
EMO-SUPERB: An In-depth Look at Speech Emotion RecognitionCode2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource ScenariosCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud UnderstandingCode2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked AutoencodersCode2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
A Foundation Model for Music InformaticsCode2
GraphMAE: Self-Supervised Masked Graph AutoencodersCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
Argoverse 2: Next Generation Datasets for Self-Driving Perception and ForecastingCode2
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature DistillationCode2
Show:102550
← PrevPage 2 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified