A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding Jun 17, 2024 Self-Supervised Learning Spoken Language Understanding
— Unverified 0DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Jun 17, 2024 3D geometry 3D Semantic Occupancy Prediction
— Unverified 0DiffMM: Multi-Modal Diffusion Model for Recommendation Jun 17, 2024 Contrastive Learning model
Code Code Available 2Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations? Jun 15, 2024 Self-Supervised Learning
— Unverified 0A Comprehensive Survey of Foundation Models in Medicine Jun 15, 2024 Graph Learning Medical Image Analysis
— Unverified 0How Should We Extract Discrete Audio Tokens from Self-Supervised Models? Jun 15, 2024 Quantization Self-Supervised Learning
— Unverified 0Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition Jun 15, 2024 Contrastive Learning Language Modeling
Code Code Available 1Self-Supervised and Few-Shot Learning for Robust Bioaerosol Monitoring Jun 14, 2024 Few-Shot Learning Self-Supervised Learning
— Unverified 0POWN: Prototypical Open-World Node Classification Jun 14, 2024 Classification Data Augmentation
Code Code Available 0Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection Jun 14, 2024 3D Object Detection Contrastive Learning
Code Code Available 0SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp segmentation Jun 14, 2024 Representation Learning Self-Supervised Learning
— Unverified 0T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity Computation Jun 13, 2024 Contrastive Learning Data Augmentation
— Unverified 0LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios Jun 13, 2024 Language Identification Self-Supervised Learning
Code Code Available 2You Don't Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning Jun 13, 2024 Data Augmentation Self-Supervised Learning
— Unverified 0Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations Jun 13, 2024 Self-Supervised Learning
— Unverified 0An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Jun 13, 2024 Image Generation Inductive Bias
— Unverified 0Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection Jun 12, 2024 Computational Efficiency Self-Supervised Learning
Code Code Available 2GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model Jun 12, 2024 Knowledge Distillation Self-Supervised Learning
— Unverified 0SCDNet: Self-supervised Learning Feature-based Speaker Change Detection Jun 12, 2024 Change Detection Contrastive Learning
— Unverified 0ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation Jun 12, 2024 Face Generation Self-Supervised Learning
— Unverified 0GraphFM: A Comprehensive Benchmark for Graph Foundation Model Jun 12, 2024 GPU Graph Neural Network
Code Code Available 0Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Jun 12, 2024 Self-Supervised Learning Visual Localization
— Unverified 0It is Never Too Late to Mend: Separate Learning for Multimedia Recommendation Jun 12, 2024 cross-modal alignment Multimedia recommendation
Code Code Available 0From Chaos to Clarity: 3DGS in the Dark Jun 12, 2024 3DGS Novel View Synthesis
— Unverified 0SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Jun 12, 2024 Image Segmentation Segmentation
Code Code Available 0Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations Jun 12, 2024 Contrastive Learning Emotion Recognition
— Unverified 0A deep cut into Split Federated Self-supervised Learning Jun 12, 2024 Federated Learning Self-Supervised Learning
Code Code Available 0Sustainable self-supervised learning for speech representations Jun 11, 2024 GPU Self-Supervised Learning
— Unverified 0Object-level Scene Deocclusion Jun 11, 2024 3D Scene Reconstruction Object
— Unverified 0Revolutionizing Wireless Networks with Self-Supervised Learning: A Pathway to Intelligent Communications Jun 11, 2024 Self-Supervised Learning Semantic Communication
— Unverified 0Visual Representation Learning with Stochastic Frame Prediction Jun 11, 2024 Decoder Pose Tracking
— Unverified 0FaceGPT: Self-supervised Learning to Chat about 3D Human Faces Jun 11, 2024 3D Face Reconstruction Face Model
— Unverified 0Higher-Order Spatial Information for Self-Supervised Place Cell Learning Jun 10, 2024 Navigate Self-Supervised Learning
— Unverified 0Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge Jun 10, 2024 Representation Learning Self-Supervised Learning
— Unverified 0Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular Data Jun 10, 2024 Drug Discovery Self-Supervised Learning
— Unverified 0NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networks Jun 10, 2024 Contrastive Learning Self-Supervised Learning
— Unverified 0Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation Jun 9, 2024 Heart Segmentation Segmentation
— Unverified 0ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations Jun 9, 2024 Self-Supervised Learning
— Unverified 0Provable Optimization for Adversarial Fair Self-supervised Contrastive Learning Jun 9, 2024 Attribute Contrastive Learning
— Unverified 0Weakly Supervised Set-Consistency Learning Improves Morphological Profiling of Single-Cell Images Jun 8, 2024 Self-Supervised Learning
Code Code Available 0Unlocking Telemetry Potential: Self-Supervised Learning for Continuous Clinical Electrocardiogram Monitoring Jun 7, 2024 Self-Supervised Learning
— Unverified 0Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition Jun 7, 2024 Emotion Recognition Self-Supervised Learning
— Unverified 0Time-Series JEPA for Predictive Remote Control under Capacity-Limited Networks Jun 7, 2024 Self-Supervised Learning Time Series
— Unverified 0On the social bias of speech self-supervised models Jun 7, 2024 Model Compression Self-Supervised Learning
— Unverified 0Denoising-Aware Contrastive Learning for Noisy Time Series Jun 7, 2024 Contrastive Learning Denoising
Code Code Available 1Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement Jun 7, 2024 Contrastive Learning Self-Supervised Learning
— Unverified 0The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning Jun 6, 2024 Anatomy Representation Learning
— Unverified 0