QAEncoder: Towards Aligned Representation Learning in Question Answering System Sep 30, 2024 Document Embedding Question Answering
Code Code Available 2Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection Sep 26, 2024 Event Detection Representation Learning
Code Code Available 2Progressive Representation Learning for Real-Time UAV Tracking Sep 25, 2024 Object Object Tracking
Code Code Available 2SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Aug 15, 2024 Continual Learning image-classification
Code Code Available 2Multistain Pretraining for Slide Representation Learning in Pathology Aug 5, 2024 Representation Learning Self-Supervised Learning
Code Code Available 2NAVIX: Scaling MiniGrid Environments with JAX Jul 28, 2024 CPU Deep Reinforcement Learning
Code Code Available 2Contrastive Learning of Asset Embeddings from Financial Time Series Jul 26, 2024 Contrastive Learning Management
Code Code Available 2Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation Jul 26, 2024 Knowledge Distillation Question Answering
Code Code Available 2Representation Learning and Identity Adversarial Training for Facial Behavior Understanding Jul 15, 2024 Facial Action Unit Detection Facial Expression Recognition (FER)
Code Code Available 2Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation Jul 11, 2024 object-detection Object Detection
Code Code Available 2PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer Jul 10, 2024 Decoder Handwritten Mathmatical Expression Recognition
Code Code Available 2TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data Jul 10, 2024 Contrastive Learning multimodal interaction
Code Code Available 24D Contrastive Superflows are Dense 3D Representation Learners Jul 8, 2024 Autonomous Driving Contrastive Learning
Code Code Available 2HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient Tuning Jul 7, 2024 Continual Learning Representation Learning
Code Code Available 2Diffusion Models and Representation Learning: A Survey Jun 30, 2024 Denoising Representation Learning
Code Code Available 2TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning Jun 21, 2024 Fairness Geographic Question Answering
Code Code Available 2Duoduo CLIP: Efficient 3D Understanding with Multi-View Images Jun 17, 2024 GPU Object
Code Code Available 2DiffMM: Multi-Modal Diffusion Model for Recommendation Jun 17, 2024 Contrastive Learning model
Code Code Available 2DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer Jun 13, 2024 Face Image Quality Face Image Quality Assessment
Code Code Available 2DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer Jun 12, 2024 Image Dehazing Nonhomogeneous Image Dehazing
Code Code Available 2RWKV-CLIP: A Robust Vision-Language Representation Learner Jun 11, 2024 Image-text Retrieval Representation Learning
Code Code Available 2Audio Mamba: Bidirectional State Space Model for Audio Representation Learning Jun 5, 2024 Audio Classification Classification
Code Code Available 2Learning Manipulation by Predicting Interaction Jun 1, 2024 Representation Learning
Code Code Available 2Matryoshka Query Transformer for Large Vision-Language Models May 29, 2024 Language Modelling Representation Learning
Code Code Available 2ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention May 28, 2024 GPU Representation Learning
Code Code Available 2SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals May 28, 2024 Contrastive Learning Representation Learning
Code Code Available 2SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model May 20, 2024 Audio Classification GPU
Code Code Available 2Transcriptomics-guided Slide Representation Learning in Computational Pathology May 19, 2024 Contrastive Learning Representation Learning
Code Code Available 2Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask May 9, 2024 Anomaly Detection Imputation
Code Code Available 2MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning May 4, 2024 Earth Observation image-classification
Code Code Available 2Benchmarking Representations for Speech, Music, and Acoustic Events May 2, 2024 Audio Classification Benchmarking
Code Code Available 2Vim4Path: Self-Supervised Vision Mamba for Histopathology Images Apr 20, 2024 Diagnostic Mamba
Code Code Available 2ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier Transformer Apr 18, 2024 Image Shadow Removal object-detection
Code Code Available 2VideoSAGE: Video Summarization with Graph Representation Learning Apr 14, 2024 Graph Representation Learning Node Classification
Code Code Available 2MindBridge: A Cross-Subject Brain Decoding Framework Apr 11, 2024 Brain Decoding Data Augmentation
Code Code Available 2Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study Apr 10, 2024 Representation Learning Time Series
Code Code Available 2NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields Apr 1, 2024 3D Object Detection NeRF
Code Code Available 2Omni-Kernel Network for Image Restoration Mar 24, 2024 Deblurring Image Defocus Deblurring
Code Code Available 2MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning Mar 13, 2024 3D Object Detection Autonomous Driving
Code Code Available 2Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image Analysis Mar 12, 2024 Graph Representation Learning Representation Learning
Code Code Available 2Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture Mar 12, 2024 Motion Magnification Representation Learning
Code Code Available 2Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement Mar 11, 2024 Clinical Knowledge Descriptive
Code Code Available 2EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation Mar 3, 2024 Object Representation Learning
Code Code Available 2Dual-domain strip attention for image restoration Mar 1, 2024 Deblurring Denoising
Code Code Available 2CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feb 29, 2024 Representation Learning Visual Place Recognition
Code Code Available 2DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Feb 28, 2024 Contrastive Learning Decision Making
Code Code Available 2CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision Feb 26, 2024 Representation Learning Transfer Learning
Code Code Available 2EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs Feb 17, 2024 EEG EEG Signal Classification
Code Code Available 2Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning Feb 7, 2024 Contrastive Learning Prediction
Code Code Available 2Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram Feb 2, 2024 Diagnostic ECG Classification
Code Code Available 2