Cross-view Masked Diffusion Transformers for Person Image Synthesis Feb 2, 2024 Denoising Image Generation
Code Code Available 2Graph Domain Adaptation: Challenges, Progress and Prospects Feb 1, 2024 Domain Adaptation GRAPH DOMAIN ADAPTATION
Code Code Available 2Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing Jan 29, 2024 GPU Representation Learning
Code Code Available 2Deconstructing Denoising Diffusion Models for Self-Supervised Learning Jan 25, 2024 Denoising Image Generation
Code Code Available 2Rethinking Patch Dependence for Masked Autoencoders Jan 25, 2024 Decoder Instance Segmentation
Code Code Available 2Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Jan 17, 2024 GPU Image Classification
Code Code Available 2HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition Jan 11, 2024 Contrastive Learning Dynamic Facial Expression Recognition
Code Code Available 2End-to-end Learnable Clustering for Intent Learning in Recommendation Jan 11, 2024 Clustering Contrastive Learning
Code Code Available 2Singer Identity Representation Learning using Self-Supervised Techniques Jan 10, 2024 Domain Generalization Representation Learning
Code Code Available 2Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry Jan 7, 2024 Data Augmentation Drug Discovery
Code Code Available 2Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions Jan 4, 2024 Representation Learning Survey
Code Code Available 2ChangeCLIP: Remote sensing change detection with multimodal vision-language representation learning Jan 4, 2024 Change Detection Decoder
Code Code Available 2Masked Modeling for Self-supervised Representation Learning on Vision and Beyond Dec 31, 2023 Representation Learning Self-Supervised Learning
Code Code Available 2Learning Vision from Models Rivals Learning Vision from Data Dec 28, 2023 Contrastive Learning Image Captioning
Code Code Available 2One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts Dec 28, 2023 All Anatomy
Code Code Available 2DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision Dec 26, 2023 Deep Learning NeRF
Code Code Available 2FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning Dec 19, 2023 Contrastive Learning Denoising
Code Code Available 2BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics Dec 12, 2023 Information Retrieval Representation Learning
Code Code Available 2Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding Nov 15, 2023 Highlight Detection Moment Retrieval
Code Code Available 2SpectralGPT: Spectral Remote Sensing Foundation Model Nov 13, 2023 Change Detection model
Code Code Available 2High-Performance Transformers for Table Structure Recognition Need Early Convolutions Nov 9, 2023 Decoder Representation Learning
Code Code Available 2Representation Learning with Large Language Models for Recommendation Oct 24, 2023 Recommendation Systems Representation Learning
Code Code Available 2Pre-training Music Classification Models via Music Source Separation Oct 24, 2023 Classification Genre classification
Code Code Available 2DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation Sep 18, 2023 3D geometry Decoder
Code Code Available 2UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation Aug 15, 2023 3D Object Detection Autonomous Driving
Code Code Available 2Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic Networks Aug 13, 2023 Graph Representation Learning Link Prediction
Code Code Available 2YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection Aug 10, 2023 Object object-detection
Code Code Available 2PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning Aug 8, 2023 Representation Learning
Code Code Available 2Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training Jul 31, 2023 Organ Segmentation Representation Learning
Code Code Available 2Hierarchical Open-vocabulary Universal Image Segmentation Jul 3, 2023 Image Comprehension Image Segmentation
Code Code Available 2DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection Jun 17, 2023 Anomaly Detection Contrastive Learning
Code Code Available 2Segment Any Point Cloud Sequences by Distilling Vision Foundation Models Jun 15, 2023 Representation Learning Transfer Learning
Code Code Available 2Fast Training of Diffusion Models with Masked Transformers Jun 15, 2023 Decoder Denoising
Code Code Available 2TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting Jun 14, 2023 Multivariate Time Series Forecasting Representation Learning
Code Code Available 2FasterViT: Fast Vision Transformers with Hierarchical Attention Jun 9, 2023 Image Classification object-detection
Code Code Available 2MolFM: A Multimodal Molecular Foundation Model Jun 6, 2023 Cross-Modal Retrieval Knowledge Graphs
Code Code Available 2A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics Jun 1, 2023 Diagnostic Representation Learning
Code Code Available 2Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning May 31, 2023 Decision Making General Knowledge
Code Code Available 2Dink-Net: Neural Clustering on Large Graphs May 28, 2023 Clustering Graph Clustering
Code Code Available 2A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence May 24, 2023 Dense Pixel Correspondence Estimation Representation Learning
Code Code Available 2Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks May 17, 2023 Graph Classification Graph Representation Learning
Code Code Available 2ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding May 14, 2023 3D Classification 3D Point Cloud Classification
Code Code Available 2TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding May 1, 2023 3D Object Detection Monocular Depth Estimation
Code Code Available 2NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation Learning Apr 28, 2023 Graph Representation Learning Knowledge Graphs
Code Code Available 2Unicom: Universal and Compact Representation Learning for Image Retrieval Apr 12, 2023 Image Classification Image Retrieval
Code Code Available 2Counterfactual Learning on Graphs: A Survey Apr 3, 2023 counterfactual Fairness
Code Code Available 2Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation Apr 3, 2023 3D Object Detection Autonomous Driving
Code Code Available 2Hierarchical Fine-Grained Image Forgery Detection and Localization Mar 30, 2023 Attribute Classification
Code Code Available 2A Systematic Study of Joint Representation Learning on Protein Sequences and Structures Mar 11, 2023 Contrastive Learning Protein Function Prediction
Code Code Available 2Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners Mar 3, 2023 Few-Shot Learning Representation Learning
Code Code Available 2