Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval May 26, 2025 Contrastive Learning cross-modal alignment
Code Code Available 1LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation May 26, 2025 Data Augmentation Domain Generalization
Code Code Available 1UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset May 21, 2025 Instance Segmentation Knowledge Distillation
Code Code Available 1PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation models May 8, 2025 Benchmarking Graph Representation Learning
Code Code Available 1fastabx: A library for efficient computation of ABX discriminability May 5, 2025 Representation Learning
Code Code Available 1SpectrumFM: A Foundation Model for Intelligent Spectrum Management May 2, 2025 Anomaly Detection Few-Shot Learning
Code Code Available 1Recursive KL Divergence Optimization: A Dynamic Framework for Representation Learning Apr 30, 2025 Contrastive Learning Dimensionality Reduction
Code Code Available 1TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation Apr 26, 2025 Imputation Multivariate Time Series Forecasting
Code Code Available 1Quadratic Interest Network for Multimodal Click-Through Rate Prediction Apr 24, 2025 Click-Through Rate Prediction Multimodal Recommendation
Code Code Available 1PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning Apr 22, 2025 parameter-efficient fine-tuning Representation Learning
Code Code Available 1Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification Apr 21, 2025 Exemplar-Free Knowledge Distillation
Code Code Available 1Mitigating Degree Bias in Graph Representation Learning with Learnable Structural Augmentation and Structural Self-Attention Apr 21, 2025 Fairness Graph Representation Learning
Code Code Available 1CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Apr 18, 2025 Common Sense Reasoning image-classification
Code Code Available 1NetTAG: A Multimodal RTL-and-Layout-Aligned Netlist Foundation Model via Text-Attributed Graph Apr 12, 2025 Graph Learning Representation Learning
Code Code Available 1Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging Apr 11, 2025 Attribute Computational Efficiency
Code Code Available 1Robo-taxi Fleet Coordination at Scale via Reinforcement Learning Apr 8, 2025 Computational Efficiency Graph Representation Learning
Code Code Available 1COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal Recommendation Apr 6, 2025 Multimodal Recommendation Representation Learning
Code Code Available 1Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry Apr 1, 2025 Representation Learning
Code Code Available 1SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning Apr 1, 2025 Representation Learning Self-Supervised Learning
Code Code Available 1MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Apr 1, 2025 Image Generation Image Reconstruction
Code Code Available 1Pluggable Style Representation Learning for Multi-Style Transfer Mar 26, 2025 Representation Learning Style Transfer
Code Code Available 1EditCLIP: Representation Learning for Image Editing Mar 26, 2025 Representation Learning
Code Code Available 1CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Mar 25, 2025 Hallucination Language Modeling
Code Code Available 1MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning Mar 24, 2025 parameter-efficient fine-tuning Representation Learning
Code Code Available 1HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving Mar 22, 2025 Autonomous Driving Representation Learning
Code Code Available 1When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning Mar 19, 2025 Representation Learning Self-Supervised Learning
Code Code Available 1Advancing Medical Representation Learning Through High-Quality Data Mar 18, 2025 Representation Learning zero-shot-classification
Code Code Available 1Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement Mar 12, 2025 Graph Representation Learning Node Classification
Code Code Available 1REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Mar 10, 2025 Instruction Following Keypoint Detection
Code Code Available 1Dynamic Dictionary Learning for Remote Sensing Image Segmentation Mar 9, 2025 Dictionary Learning Image Segmentation
Code Code Available 1Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning Mar 8, 2025 Deep Reinforcement Learning Representation Learning
Code Code Available 1Improve Representation for Imbalanced Regression through Geometric Constraints Mar 2, 2025 Operator learning regression
Code Code Available 1Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning Mar 2, 2025 Large Language Model Multi-Instance Retrieval
Code Code Available 1MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention Mar 1, 2025 Clustering Representation Learning
Code Code Available 1Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud Denoising Feb 27, 2025 Denoising Representation Learning
Code Code Available 1EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training Feb 26, 2025 Mamba Representation Learning
Code Code Available 1Escaping The Big Data Paradigm in Self-Supervised Representation Learning Feb 25, 2025 Representation Learning
Code Code Available 1Understanding the Emergence of Multimodal Representation Alignment Feb 22, 2025 Representation Learning
Code Code Available 1RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm Feb 18, 2025 Representation Learning Retrieval
Code Code Available 1Myna: Masking-Based Contrastive Learning of Musical Representations Feb 18, 2025 Contrastive Learning Data Augmentation
Code Code Available 1Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning Feb 17, 2025 Audio Classification Audio Tagging
Code Code Available 1Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model Feb 15, 2025 Language Modeling Language Modelling
Code Code Available 1AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting Feb 14, 2025 Multivariate Time Series Forecasting Representation Learning
Code Code Available 1JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata Feb 11, 2025 Language Modeling Language Modelling
Code Code Available 1RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning Feb 10, 2025 Language Modeling Language Modelling
Code Code Available 1From Pixels to Components: Eigenvector Masking for Visual Representation Learning Feb 10, 2025 image-classification Image Classification
Code Code Available 1Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning Feb 8, 2025 Graph Attention Representation Learning
Code Code Available 1Intent Representation Learning with Large Language Model for Recommendation Feb 5, 2025 Language Modeling Language Modelling
Code Code Available 1Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification Feb 4, 2025 Cell Segmentation Decoder
Code Code Available 1