SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 151200 of 5044 papers

TitleStatusHype
Diffusion Models and Representation Learning: A SurveyCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentCode2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised LearningCode2
Multistain Pretraining for Slide Representation Learning in PathologyCode2
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowCode2
InfMAE: A Foundation Model in the Infrared ModalityCode2
Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function PredictionsCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsCode2
Dynamic 3D Point Cloud Sequences as 2D VideosCode2
EMO-SUPERB: An In-depth Look at Speech Emotion RecognitionCode2
An OpenMind for 3D medical vision self-supervised learningCode2
BirdSet: A Large-Scale Dataset for Audio Classification in Avian BioacousticsCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel FusionCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud UnderstandingCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
Multi-Modal Self-Supervised Learning for RecommendationCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
Contrastive Audio-Visual Masked AutoencoderCode2
Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motionCode2
Context Autoencoder for Self-Supervised Representation LearningCode2
Pengi: An Audio Language Model for Audio TasksCode2
Argoverse 2: Next Generation Datasets for Self-Driving Perception and ForecastingCode2
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-trainingCode2
A Comprehensive Survey on Self-Supervised Learning for RecommendationCode2
A Foundation Model for Music InformaticsCode2
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature DistillationCode2
A Versatile Framework for Multi-scene Person Re-identificationCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language ModelCode2
BYOL for Audio: Exploring Pre-trained General-purpose Audio RepresentationsCode2
A Multimodal Vision Foundation Model for Clinical DermatologyCode2
DGFont++: Robust Deformable Generative Networks for Unsupervised Font GenerationCode2
Scaling up self-supervised learning for improved surgical foundation modelsCode2
Self-Supervised Learning for Recommender Systems: A SurveyCode2
SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance FieldsCode2
A Simple Framework for Contrastive Learning of Visual RepresentationsCode2
Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing ModelCode2
Self-Supervised Learning from Images with a Joint-Embedding Predictive ArchitectureCode2
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural CalibrationCode2
A Survey of Spatio-Temporal EEG data Analysis: from Models to ApplicationsCode2
CLARA: Multilingual Contrastive Learning for Audio Representation AcquisitionCode1
Adaptive Graph Contrastive Learning for RecommendationCode1
Automatically Discovering and Learning New Visual Categories with Ranking StatisticsCode1
Automatic identification of segmentation errors for radiotherapy using geometric learningCode1
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIPCode1
Show:102550
← PrevPage 4 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified