Structurally Different Neural Network Blocks for the Segmentation of Atrial and Aortic Perivascular Adipose Tissue in Multi-centre CT Angiography Scans Jun 6, 2023 Diagnostic Image Segmentation
— Unverified 0DenseDINO: Boosting Dense Self-Supervised Learning with Token-Based Point-Level Consistency Jun 6, 2023 Position Segmentation
— Unverified 0Subgraph Networks Based Contrastive Learning Jun 6, 2023 Attribute Contrastive Learning
— Unverified 0Green Steganalyzer: A Green Learning Approach to Image Steganalysis Jun 6, 2023 Self-Supervised Learning Steganalysis
— Unverified 0OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition Jun 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System Jun 5, 2023 Multi-Task Learning Representation Learning
— Unverified 0Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work Jun 2, 2023 Fine-Grained Visual Recognition Person Re-Identification
— Unverified 0HomE: Homography-Equivariant Video Representation Learning Jun 2, 2023 Action Classification Action Recognition
Code Code Available 0Masked Autoencoder for Unsupervised Video Summarization Jun 2, 2023 Decoder Self-Supervised Learning
— Unverified 0Speech Self-Supervised Representation Benchmarking: Are We Doing it Right? Jun 1, 2023 Benchmarking Decoder
Code Code Available 0On the Robustness of Arabic Speech Dialect Identification Jun 1, 2023 Dialect Identification Self-Supervised Learning
— Unverified 0Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations Jun 1, 2023 Data Augmentation Domain Adaptation
— Unverified 0A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image Modeling Jun 1, 2023 Data Augmentation Self-Supervised Learning
Code Code Available 0Some voices are too common: Building fair speech recognition systems using the Common Voice dataset Jun 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improved Cross-Lingual Transfer Learning For Automatic Speech Translation Jun 1, 2023 automatic-speech-translation Cross-Lingual Transfer
— Unverified 0Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression Jun 1, 2023 Contrastive Learning Data Augmentation
— Unverified 0Feature Learning in Image Hierarchies using Functional Maximal Correlation May 31, 2023 Self-Supervised Learning
— Unverified 0Augmentation-aware Self-supervised Learning with Conditioned Projector May 31, 2023 Self-Supervised Learning Sensitivity
Code Code Available 0There is more to graphs than meets the eye: Learning universal features with self-supervision May 31, 2023 Node Classification Representation Learning
— Unverified 0Spectal Harmonics: Bridging Spectral Embedding and Matrix Completion in Self-Supervised Learning May 31, 2023 Inductive Bias Low-Rank Matrix Completion
— Unverified 0Additional Positive Enables Better Representation Learning for Medical Images May 31, 2023 Representation Learning Self-Supervised Learning
— Unverified 0Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion May 31, 2023 Retrieval Self-Supervised Learning
— Unverified 0SSL-CPCD: Self-supervised learning with composite pretext-class discrimination for improved generalisability in endoscopic image analysis May 31, 2023 Medical Image Analysis Self-Supervised Learning
— Unverified 0Quantifying Representation Reliability in Self-Supervised Learning Models May 31, 2023 Self-Supervised Learning Uncertainty Quantification
Code Code Available 0MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models May 30, 2023 Self-Supervised Learning
Code Code Available 0Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models May 30, 2023 Emotion Recognition Self-Supervised Learning
— Unverified 0Learning Off-Road Terrain Traversability with Self-Supervisions Only May 30, 2023 Autonomous Driving One-Class Classification
— Unverified 0MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations May 29, 2023 Few-Shot Audio Classification Inductive Bias
Code Code Available 0Self-Supervised Learning of Action Affordances as Interaction Modes May 27, 2023 Object Self-Supervised Learning
— Unverified 0Unsupervised Embedding Quality Evaluation May 26, 2023 Self-Supervised Learning
— Unverified 0Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance May 25, 2023 Decoder image-classification
— Unverified 0Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition May 25, 2023 Denoising Self-Supervised Learning
— Unverified 0Generalizable Low-Resource Activity Recognition with Diverse and Discriminative Representation Learning May 25, 2023 Activity Recognition Contrastive Learning
— Unverified 0MPE4G: Multimodal Pretrained Encoder for Co-Speech Gesture Generation May 25, 2023 Gesture Generation Self-Supervised Learning
— Unverified 0Reverse Engineering Self-Supervised Learning May 24, 2023 Clustering Representation Learning
— Unverified 0Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss May 24, 2023 Self-Supervised Learning Speech Enhancement
— Unverified 0Delving Deeper into Data Scaling in Masked Image Modeling May 24, 2023 Self-Supervised Learning
— Unverified 0Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model May 24, 2023 Self-Supervised Learning
— Unverified 0Collaborative Auto-encoding for Blind Image Quality Assessment May 24, 2023 Decoder Descriptive
Code Code Available 0Difference-Masking: Choosing What to Mask in Continued Pretraining May 23, 2023 Self-Supervised Learning
Code Code Available 0Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation May 23, 2023 Denoising Knowledge Distillation
— Unverified 0TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Autoencoder-based Snow Drought Index May 23, 2023 Self-Supervised Learning
— Unverified 0Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers? May 23, 2023 Caller Detection Self-Supervised Learning
Code Code Available 0Atomic and Subgraph-aware Bilateral Aggregation for Molecular Representation Learning May 22, 2023 Molecular Property Prediction molecular representation
— Unverified 0ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer May 22, 2023 Decoder Denoising
— Unverified 0EnSiam: Self-Supervised Learning With Ensemble Representations May 22, 2023 Contrastive Learning Knowledge Distillation
— Unverified 0Contrastive Predictive Autoencoders for Dynamic Point Cloud Self-Supervised Learning May 22, 2023 Action Recognition Colorization
— Unverified 0Self-supervised representations in speech-based depression detection May 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SurgMAE: Masked Autoencoders for Long Surgical Video Analysis May 19, 2023 Self-Supervised Learning
— Unverified 0