Pushing the limits of raw waveform speaker recognition Mar 16, 2022 Self-Supervised Learning Speaker Recognition
Code Code Available 35 Masked Siamese Networks for Label-Efficient Learning Apr 14, 2022 image-classification Image Classification
Code Code Available 25 A Survey on Mixup Augmentations and Beyond Sep 8, 2024 Image Classification Self-Supervised Learning
Code Code Available 25 Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology Apr 16, 2024 Drug Discovery Self-Supervised Learning
Code Code Available 25 LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 25 A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications Sep 26, 2024 EEG Self-Supervised Learning
Code Code Available 25 Lightweight, Pre-trained Transformers for Remote Sensing Timeseries Apr 27, 2023 Crop Classification Self-Supervised Learning
Code Code Available 25 Low-resource finetuning of foundation models beats state-of-the-art in histopathology Jan 9, 2024 GPU Self-Supervised Learning
Code Code Available 25 Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV Mar 3, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 25 A Simple Framework for Contrastive Learning of Visual Representations Feb 13, 2020 Contrastive Learning Image Classification
Code Code Available 25 Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Oct 30, 2024 General Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 25 Masked Modeling for Self-supervised Representation Learning on Vision and Beyond Dec 31, 2023 Representation Learning Self-Supervised Learning
Code Code Available 25 ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Sep 26, 2019 Common Sense Reasoning GPU
Code Code Available 25 InfMAE: A Foundation Model in the Infrared Modality Feb 1, 2024 Decoder Self-Supervised Learning
Code Code Available 25 Interpretable RNA Foundation Model from Unannotated Data for Highly Accurate RNA Structure and Function Predictions Apr 1, 2022 Self-Supervised Learning
Code Code Available 25 Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation Jan 1, 2024 General Knowledge Navigate
Code Code Available 25 Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning Oct 24, 2022 GPU Self-Supervised Learning
Code Code Available 25 CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow Nov 18, 2022 Optical Flow Estimation Position
Code Code Available 25 HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition Jan 11, 2024 Contrastive Learning Dynamic Facial Expression Recognition
Code Code Available 25 Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model Jun 14, 2022 Decision Making News Classification
Code Code Available 25 GraphMAE: Self-Supervised Masked Graph Autoencoders May 22, 2022 Contrastive Learning Graph Classification
Code Code Available 25 High-Performance Transformers for Table Structure Recognition Need Early Convolutions Nov 9, 2023 Decoder Representation Learning
Code Code Available 25 ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers Apr 20, 2022 Disentanglement Self-Supervised Learning
Code Code Available 25 MedIAnomaly: A comparative study of anomaly detection in medical images Apr 6, 2024 Anomaly Classification Anomaly Detection
Code Code Available 25 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Nov 19, 2024 3D Object Detection Autonomous Driving
Code Code Available 25 Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders Aug 19, 2023 Inductive Bias Motion Forecasting
Code Code Available 25 FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning Dec 16, 2024 DeepFake Detection diffusion-generated faces detection
Code Code Available 25 Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision Jul 25, 2024 Diversity Medical Image Analysis
Code Code Available 25 EMP-SSL: Towards Self-Supervised Learning in One Training Epoch Apr 8, 2023 Quantization Self-Supervised Learning
Code Code Available 25 Equivariant Multi-Modality Image Fusion May 19, 2023 Self-Supervised Learning
Code Code Available 25 EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones Nov 17, 2022 Data Augmentation Self-Supervised Learning
Code Code Available 25 Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders Mar 26, 2024 Object Self-Supervised Learning
Code Code Available 25 DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment Jan 16, 2024 Disentanglement Self-Supervised Learning
Code Code Available 25 DM-Codec: Distilling Multimodal Representations for Speech Tokenization Oct 19, 2024 Self-Supervised Learning Speech Tokenization
Code Code Available 25 Dynamic 3D Point Cloud Sequences as 2D Videos Mar 2, 2024 Action Recognition Self-Supervised Learning
Code Code Available 25 DiffMM: Multi-Modal Diffusion Model for Recommendation Jun 17, 2024 Contrastive Learning model
Code Code Available 25 An OpenMind for 3D medical vision self-supervised learning Dec 22, 2024 Benchmarking Self-Supervised Learning
Code Code Available 25 Diffusion Models and Representation Learning: A Survey Jun 30, 2024 Denoising Representation Learning
Code Code Available 25 A Comprehensive Survey on Self-Supervised Learning for Recommendation Apr 4, 2024 Contrastive Learning Recommendation Systems
Code Code Available 25 EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 25 GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 25 Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing Jan 29, 2024 GPU Representation Learning
Code Code Available 25 A Foundation Model for Music Informatics Nov 6, 2023 Information Retrieval model
Code Code Available 25 Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting Jan 2, 2023 3D Object Detection Motion Forecasting
Code Code Available 25 Deconstructing Denoising Diffusion Models for Self-Supervised Learning Jan 25, 2024 Denoising Image Generation
Code Code Available 25 A Multimodal Vision Foundation Model for Clinical Dermatology Oct 19, 2024 Diagnostic Lesion Segmentation
Code Code Available 25 Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram Feb 2, 2024 Diagnostic ECG Classification
Code Code Available 25 HASSOD: Hierarchical Adaptive Self-Supervised Object Detection Feb 5, 2024 Object object-detection
Code Code Available 25 Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation May 27, 2022 Contrastive Learning image-classification
Code Code Available 25 CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding Mar 1, 2022 3D Object Classification 3D Point Cloud Linear Classification
Code Code Available 25