SOTAVerified

Self-Supervised Learning

Self-Supervised Learning is proposed for utilizing unlabeled data with the success of supervised learning. Producing a dataset with good labels is expensive, while unlabeled data is being generated all the time. The motivation of Self-Supervised Learning is to make use of the large amount of unlabeled data. The main idea of Self-Supervised Learning is to generate the labels from unlabeled data, according to the structure or characteristics of the data itself, and then train on this unsupervised data in a supervised manner. Self-Supervised Learning is wildly used in representation learning to make a model learn the latent features of the data. This technique is often employed in computer vision, video processing and robot control.

Source: Self-supervised Point Set Local Descriptors for Point Cloud Registration

Image source: LeCun

Papers

Showing 201250 of 5044 papers

TitleStatusHype
Heart Failure Prediction using Modal Decomposition and Masked Autoencoders for Scarce Echocardiography DatabasesCode1
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene UnderstandingCode1
Evolutionary algorithms meet self-supervised learning: a comprehensive survey0
Dual Deep Learning Approach for Non-invasive Renal Tumour Subtyping with VERDICT-MRI0
Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory PerceptionCode1
Leveraging Auto-Distillation and Generative Self-Supervised Learning in Residual Graph Transformers for Enhanced Recommender Systems0
A Self-Supervised Framework for Space Object Behaviour Characterisation0
Graph Neural Network-Based Distributed Optimal Control for Linear Networked Systems: An Online Distributed Training Approach0
Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos0
Bridging the Gap between Continuous and Informative Discrete Representations by Random Product Quantization0
Variational Self-Supervised Learning0
Multi-encoder nnU-Net outperforms Transformer models with self-supervised pretraining0
RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation0
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders0
MIMRS: A Survey on Masked Image Modeling in Remote Sensing0
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning0
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Towards Computation- and Communication-efficient Computational Pathology0
SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningCode1
Training Frozen Feature Pyramid DINOv2 for Eyelid Measurements with Infinite Encoding and Orthogonal RegularizationCode0
A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing0
Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction0
PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks0
AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection0
Federated Self-Supervised Learning for One-Shot Cross-Modal and Cross-Imaging Technique Segmentation0
M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAPCode0
Learning Library Cell Representations in Vector Space0
A Self-Supervised Learning of a Foundation Model for Analog Layout Design Automation0
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets0
Efficient Building Roof Type Classification: A Domain-Specific Self-Supervised Approach0
Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch BoostingCode0
Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications0
Learnable Sequence Augmenter for Triplet Contrastive Learning in Sequential Recommendation0
D2SA: Dual-Stage Distribution and Slice Adaptation for Efficient Test-Time Adaptation in MRI Reconstruction0
Social Network User Profiling for Anomaly Detection Based on Graph Neural Networks0
An Overview of Low-Rank Structures in the Training and Adaptation of Large Models0
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals0
Noisier2Inverse: Self-Supervised Learning for Image Reconstruction with Correlated NoiseCode0
Foundation Model for Whole-Heart Segmentation: Leveraging Student-Teacher Learning in Multi-Modal Medical Imaging0
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text RecognitionCode1
Do Your Best and Get Enough Rest for Continual LearningCode1
Self-Supervised Learning based on Transformed Image Reconstruction for Equivariance-Coherent Feature Representation0
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding0
HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications0
Interpretable Feature Interaction via Statistical Self-supervised Learning on Tabular Data0
PIM: Physics-Informed Multi-task Pre-training for Improving Inertial Sensor-Based Human Activity Recognition0
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language PretrainingCode3
Real-World Remote Sensing Image Dehazing: Benchmark and BaselineCode1
Meta-Representational Predictive Coding: Biomimetic Self-Supervised Learning0
Should we pre-train a decoder in contrastive learning for dense prediction tasks?0
Show:102550
← PrevPage 5 of 101Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pretraining: NoneImages & Text57.5Unverified
2Pretraining: ShEDImages & Text54.3Unverified
3Pretraining: e-MixImages & Text48.9Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50Accuracy91.7Unverified
2ResNet18Accuracy91.02Unverified
3MV-MRAccuracy89.67Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy93.89Unverified
2ResNet18average top-1 classification accuracy92.58Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy72.51Unverified
2ResNet18average top-1 classification accuracy69.31Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy82.64Unverified
2CorInfomax (ResNet18)Top-1 Accuracy80.48Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet50average top-1 classification accuracy51.84Unverified
2ResNet18average top-1 classification accuracy51.67Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy93.18Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet18)Top-1 Accuracy71.61Unverified
#ModelMetricClaimedVerifiedStatus
1Hybrid BYOL-S/CvTAccuracy67.2Unverified
#ModelMetricClaimedVerifiedStatus
1CorInfomax (ResNet50)Top-1 Accuracy54.86Unverified