Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram Feb 2, 2024 Diagnostic ECG Classification
Code Code Available 25 MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset Jun 29, 2023 Image Segmentation Medical Image Segmentation
Code Code Available 25 GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 25 AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning Apr 12, 2022 Activity Recognition Domain Adaptation
Code Code Available 25 Multiview Compressive Coding for 3D Reconstruction Jan 19, 2023 3D Reconstruction Decoder
Code Code Available 25 GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner Apr 10, 2023 Self-Supervised Learning
Code Code Available 25 OmniSat: Self-Supervised Modality Fusion for Earth Observation Apr 12, 2024 Diversity Earth Observation
Code Code Available 25 PaPaGei: Open Foundation Models for Optical Physiological Signals Oct 27, 2024 Contrastive Learning Domain Generalization
Code Code Available 25 PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders Aug 16, 2024 3D Object Classification 3D Point Cloud Classification
Code Code Available 25 Pengi: An Audio Language Model for Audio Tasks May 19, 2023 Audio captioning Audio Question Answering
Code Code Available 25 HASSOD: Hierarchical Adaptive Self-Supervised Object Detection Feb 5, 2024 Object object-detection
Code Code Available 25 Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection Sep 26, 2024 Event Detection Representation Learning
Code Code Available 25 InfMAE: A Foundation Model in the Infrared Modality Feb 1, 2024 Decoder Self-Supervised Learning
Code Code Available 25 Masked Modeling for Self-supervised Representation Learning on Vision and Beyond Dec 31, 2023 Representation Learning Self-Supervised Learning
Code Code Available 25 Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders Aug 19, 2023 Inductive Bias Motion Forecasting
Code Code Available 25 A Simple Framework for Contrastive Learning of Visual Representations Feb 13, 2020 Contrastive Learning Image Classification
Code Code Available 25 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Mar 13, 2025 Computational Efficiency Mamba
Code Code Available 25 FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning Dec 16, 2024 DeepFake Detection diffusion-generated faces detection
Code Code Available 25 Equivariant Multi-Modality Image Fusion May 19, 2023 Self-Supervised Learning
Code Code Available 25 Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision Jul 25, 2024 Diversity Medical Image Analysis
Code Code Available 25 SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation Apr 2, 2024 3D Pose Estimation Pose Estimation
Code Code Available 25 Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification Aug 13, 2022 Contrastive Learning Data Augmentation
Code Code Available 25 Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects Jun 16, 2023 Anomaly Detection Self-Supervised Learning
Code Code Available 25 Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Jan 19, 2023 Depth Estimation Depth Prediction
Code Code Available 25 Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask May 9, 2024 Anomaly Detection Imputation
Code Code Available 25 Self-Supervised Log Parsing Mar 17, 2020 Anomaly Detection Fault Detection
Code Code Available 25 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Nov 19, 2024 3D Object Detection Autonomous Driving
Code Code Available 25 EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 25 A generalizable 3D framework and model for self-supervised learning in medical imaging Jan 20, 2025 Medical Image Segmentation Self-Supervised Learning
Code Code Available 25 Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders Mar 26, 2024 Object Self-Supervised Learning
Code Code Available 25 A Foundation Model for Music Informatics Nov 6, 2023 Information Retrieval model
Code Code Available 25 Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection Jun 12, 2024 Computational Efficiency Self-Supervised Learning
Code Code Available 25 EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones Nov 17, 2022 Data Augmentation Self-Supervised Learning
Code Code Available 25 EMP-SSL: Towards Self-Supervised Learning in One Training Epoch Apr 8, 2023 Quantization Self-Supervised Learning
Code Code Available 25 DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment Jan 16, 2024 Disentanglement Self-Supervised Learning
Code Code Available 25 DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation Dec 30, 2022 Font Generation Image-to-Image Translation
Code Code Available 25 DiffMM: Multi-Modal Diffusion Model for Recommendation Jun 17, 2024 Contrastive Learning model
Code Code Available 25 Dynamic 3D Point Cloud Sequences as 2D Videos Mar 2, 2024 Action Recognition Self-Supervised Learning
Code Code Available 25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Jan 25, 2024 Denoising Image Generation
Code Code Available 25 Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing Jan 29, 2024 GPU Representation Learning
Code Code Available 25 Diffusion Models and Representation Learning: A Survey Jun 30, 2024 Denoising Representation Learning
Code Code Available 25 DM-Codec: Distilling Multimodal Representations for Speech Tokenization Oct 19, 2024 Self-Supervised Learning Speech Tokenization
Code Code Available 25 Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel Fusion May 7, 2022 Hyperspectral Image Super-Resolution Image Super-Resolution
Code Code Available 25 Attention Mechanisms in Computer Vision: A Survey Nov 15, 2021 image-classification Image Classification
Code Code Available 25 A Multimodal Vision Foundation Model for Clinical Dermatology Oct 19, 2024 Diagnostic Lesion Segmentation
Code Code Available 25 A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications Sep 26, 2024 EEG Self-Supervised Learning
Code Code Available 25 Multistain Pretraining for Slide Representation Learning in Pathology Aug 5, 2024 Representation Learning Self-Supervised Learning
Code Code Available 25 A Survey on Mixup Augmentations and Beyond Sep 8, 2024 Image Classification Self-Supervised Learning
Code Code Available 25 DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks Sep 10, 2024 Contrastive Learning Image Reconstruction
Code Code Available 25 Contrastive Audio-Visual Masked Autoencoder Oct 2, 2022 Audio Classification Audio Tagging
Code Code Available 25