Diffusion Models and Representation Learning: A Survey Jun 30, 2024 Denoising Representation Learning
Code Code Available 25 HASSOD: Hierarchical Adaptive Self-Supervised Object Detection Feb 5, 2024 Object object-detection
Code Code Available 25 DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment Jan 16, 2024 Disentanglement Self-Supervised Learning
Code Code Available 25 Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning Oct 24, 2022 GPU Self-Supervised Learning
Code Code Available 25 Multistain Pretraining for Slide Representation Learning in Pathology Aug 5, 2024 Representation Learning Self-Supervised Learning
Code Code Available 25 CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow Nov 18, 2022 Optical Flow Estimation Position
Code Code Available 25 InfMAE: A Foundation Model in the Infrared Modality Feb 1, 2024 Decoder Self-Supervised Learning
Code Code Available 25 Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions Apr 1, 2022 Self-Supervised Learning
Code Code Available 25 DiffMM: Multi-Modal Diffusion Model for Recommendation Jun 17, 2024 Contrastive Learning model
Code Code Available 25 LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 25 Dynamic 3D Point Cloud Sequences as 2D Videos Mar 2, 2024 Action Recognition Self-Supervised Learning
Code Code Available 25 EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 25 An OpenMind for 3D medical vision self-supervised learning Dec 22, 2024 Benchmarking Self-Supervised Learning
Code Code Available 25 BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics Mar 15, 2024 Audio Classification Classification
Code Code Available 25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Jan 25, 2024 Denoising Image Generation
Code Code Available 25 Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing Jan 29, 2024 GPU Representation Learning
Code Code Available 25 Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image Super-Resolution with Subpixel Fusion May 7, 2022 Hyperspectral Image Super-Resolution Image Super-Resolution
Code Code Available 25 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Sep 26, 2019 Common Sense Reasoning GPU
Code Code Available 25 CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding Mar 1, 2022 3D Object Classification 3D Point Cloud Linear Classification
Code Code Available 25 Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation Jul 19, 2024 Data Augmentation Depth Estimation
Code Code Available 25 Multi-Modal Self-Supervised Learning for Recommendation Feb 21, 2023 Contrastive Learning Data Augmentation
Code Code Available 25 DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks Sep 10, 2024 Contrastive Learning Image Reconstruction
Code Code Available 25 Contrastive Audio-Visual Masked Autoencoder Oct 2, 2022 Audio Classification Audio Tagging
Code Code Available 25 Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion Aug 15, 2020 Depth Estimation Motion Estimation
Code Code Available 25 Context Autoencoder for Self-Supervised Representation Learning Feb 7, 2022 Decoder Instance Segmentation
Code Code Available 25 Pengi: An Audio Language Model for Audio Tasks May 19, 2023 Audio captioning Audio Question Answering
Code Code Available 25 Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting Jan 2, 2023 3D Object Detection Motion Forecasting
Code Code Available 25 Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training May 28, 2022 3D Object Detection 3D Point Cloud Classification
Code Code Available 25 A Comprehensive Survey on Self-Supervised Learning for Recommendation Apr 4, 2024 Contrastive Learning Recommendation Systems
Code Code Available 25 A Foundation Model for Music Informatics Nov 6, 2023 Information Retrieval model
Code Code Available 25 Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation May 27, 2022 Contrastive Learning image-classification
Code Code Available 25 A Versatile Framework for Multi-scene Person Re-identification Mar 17, 2024 Data Augmentation Person Re-Identification
Code Code Available 25 Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation Aug 5, 2024 Rhythm Self-Supervised Learning
Code Code Available 25 CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model Mar 3, 2020 8k Language Modeling
Code Code Available 25 BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations Apr 15, 2022 Self-Supervised Learning
Code Code Available 25 A Multimodal Vision Foundation Model for Clinical Dermatology Oct 19, 2024 Diagnostic Lesion Segmentation
Code Code Available 25 DGFont++: Robust Deformable Generative Networks for Unsupervised Font Generation Dec 30, 2022 Font Generation Image-to-Image Translation
Code Code Available 25 Scaling up self-supervised learning for improved surgical foundation models Jan 16, 2025 Self-Supervised Learning Semantic Segmentation
Code Code Available 25 Self-Supervised Learning for Recommender Systems: A Survey Mar 29, 2022 Recommendation Systems Self-Supervised Learning
Code Code Available 25 SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields Dec 5, 2022 3D Reconstruction 3D Scene Reconstruction
Code Code Available 25 A Simple Framework for Contrastive Learning of Visual Representations Feb 13, 2020 Contrastive Learning Image Classification
Code Code Available 25 Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model Jun 14, 2022 Decision Making News Classification
Code Code Available 25 Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Jan 19, 2023 Depth Estimation Depth Prediction
Code Code Available 25 Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration Jan 23, 2024 3D Semantic Segmentation Autonomous Driving
Code Code Available 25 A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications Sep 26, 2024 EEG Self-Supervised Learning
Code Code Available 25 CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition Oct 18, 2023 Audio Classification Contrastive Learning
Code Code Available 15 Adaptive Graph Contrastive Learning for Recommendation May 18, 2023 Collaborative Filtering Contrastive Learning
Code Code Available 15 Automatically Discovering and Learning New Visual Categories with Ranking Statistics Feb 13, 2020 Clustering General Classification
Code Code Available 15 Automatic identification of segmentation errors for radiotherapy using geometric learning Jun 27, 2022 Graph Neural Network Self-Supervised Learning
Code Code Available 15 CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP Jan 12, 2023 3D Semantic Segmentation Contrastive Learning
Code Code Available 15