SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation Apr 2, 2024 3D Pose Estimation Pose Estimation
Code Code Available 2Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders Mar 26, 2024 Object Self-Supervised Learning
Code Code Available 2Towards Large-Scale Training of Pathology Foundation Models Mar 24, 2024 Nuclear Segmentation Self-Supervised Learning
Code Code Available 2Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs Mar 19, 2024 Few-Shot Learning Self-Supervised Learning
Code Code Available 2A Versatile Framework for Multi-scene Person Re-identification Mar 17, 2024 Data Augmentation Person Re-Identification
Code Code Available 2BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics Mar 15, 2024 Audio Classification Classification
Code Code Available 2Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement Mar 11, 2024 Clinical Knowledge Descriptive
Code Code Available 2Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV Mar 3, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 2Dynamic 3D Point Cloud Sequences as 2D Videos Mar 2, 2024 Action Recognition Self-Supervised Learning
Code Code Available 2EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 2HASSOD: Hierarchical Adaptive Self-Supervised Object Detection Feb 5, 2024 Object object-detection
Code Code Available 2Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram Feb 2, 2024 Diagnostic ECG Classification
Code Code Available 2InfMAE: A Foundation Model in the Infrared Modality Feb 1, 2024 Decoder Self-Supervised Learning
Code Code Available 2Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing Jan 29, 2024 GPU Representation Learning
Code Code Available 2Deconstructing Denoising Diffusion Models for Self-Supervised Learning Jan 25, 2024 Denoising Image Generation
Code Code Available 2Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration Jan 23, 2024 3D Semantic Segmentation Autonomous Driving
Code Code Available 2DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment Jan 16, 2024 Disentanglement Self-Supervised Learning
Code Code Available 2HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition Jan 11, 2024 Contrastive Learning Dynamic Facial Expression Recognition
Code Code Available 2Singer Identity Representation Learning using Self-Supervised Techniques Jan 10, 2024 Domain Generalization Representation Learning
Code Code Available 2Low-resource finetuning of foundation models beats state-of-the-art in histopathology Jan 9, 2024 GPU Self-Supervised Learning
Code Code Available 2PhilEO Bench: Evaluating Geo-Spatial Foundation Models Jan 9, 2024 Density Estimation Earth Observation
Code Code Available 2Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation Jan 1, 2024 General Knowledge Navigate
Code Code Available 2Masked Modeling for Self-supervised Representation Learning on Vision and Beyond Dec 31, 2023 Representation Learning Self-Supervised Learning
Code Code Available 2PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains Dec 15, 2023 Self-Supervised Learning
Code Code Available 2High-Performance Transformers for Table Structure Recognition Need Early Convolutions Nov 9, 2023 Decoder Representation Learning
Code Code Available 2A Foundation Model for Music Informatics Nov 6, 2023 Information Retrieval model
Code Code Available 2Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks Oct 30, 2023 Benchmarking object-detection
Code Code Available 2GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 2UniPAD: A Universal Pre-training Paradigm for Autonomous Driving Oct 12, 2023 3D Object Detection 3D Semantic Segmentation
Code Code Available 2Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders Aug 19, 2023 Inductive Bias Motion Forecasting
Code Code Available 2SSLRec: A Self-Supervised Learning Framework for Recommendation Aug 10, 2023 Collaborative Filtering Data Augmentation
Code Code Available 2MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated Dataset Jun 29, 2023 Image Segmentation Medical Image Segmentation
Code Code Available 2RemoteCLIP: A Vision Language Foundation Model for Remote Sensing Jun 19, 2023 Classification Cross-Modal Retrieval
Code Code Available 2Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects Jun 16, 2023 Anomaly Detection Self-Supervised Learning
Code Code Available 2TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting Jun 14, 2023 Multivariate Time Series Forecasting Representation Learning
Code Code Available 2MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training May 31, 2023 Language Modelling Quantization
Code Code Available 2Pengi: An Audio Language Model for Audio Tasks May 19, 2023 Audio captioning Audio Question Answering
Code Code Available 2Equivariant Multi-Modality Image Fusion May 19, 2023 Self-Supervised Learning
Code Code Available 2Lightweight, Pre-trained Transformers for Remote Sensing Timeseries Apr 27, 2023 Crop Classification Self-Supervised Learning
Code Code Available 2Very high resolution canopy height maps from RGB imagery using self-supervised vision transformer and convolutional decoder trained on Aerial Lidar Apr 14, 2023 Decoder Self-Supervised Learning
Code Code Available 2GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner Apr 10, 2023 Self-Supervised Learning
Code Code Available 2Slideflow: Deep Learning for Digital Histopathology with Real-Time Whole-Slide Visualization Apr 9, 2023 Deep Learning Histopathological Image Classification
Code Code Available 2EMP-SSL: Towards Self-Supervised Learning in One Training Epoch Apr 8, 2023 Quantization Self-Supervised Learning
Code Code Available 2Self-Supervised Multimodal Learning: A Survey Mar 31, 2023 Machine Translation Self-Supervised Learning
Code Code Available 2Automated Self-Supervised Learning for Recommendation Mar 14, 2023 Collaborative Filtering Contrastive Learning
Code Code Available 2Stabilizing Transformer Training by Preventing Attention Entropy Collapse Mar 11, 2023 Automatic Speech Recognition image-classification
Code Code Available 2Towards Democratizing Joint-Embedding Self-Supervised Learning Mar 3, 2023 Data Augmentation Misconceptions
Code Code Available 2Multi-Modal Self-Supervised Learning for Recommendation Feb 21, 2023 Contrastive Learning Data Augmentation
Code Code Available 2ClimaX: A foundation model for weather and climate Jan 24, 2023 model Self-Supervised Learning
Code Code Available 2Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture Jan 19, 2023 Depth Estimation Depth Prediction
Code Code Available 2