Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Feb 6, 2025 image-classification Image Classification
Code Code Available 2LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Jan 17, 2025 Change Detection Image Classification
Code Code Available 2Practical Continual Forgetting for Pre-trained Vision Models Jan 16, 2025 Continual Forgetting Face Recognition
Code Code Available 2Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Jan 14, 2025 image-classification Image Classification
Code Code Available 2TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios Jan 10, 2025 Aerial Scene Classification CPU
Code Code Available 2MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification Jan 9, 2025 Classification Hyperspectral Image Classification
Code Code Available 2FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning Dec 16, 2024 DeepFake Detection diffusion-generated faces detection
Code Code Available 2Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation Dec 11, 2024 image-classification Image Classification
Code Code Available 22DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification Dec 1, 2024 Computational Efficiency image-classification
Code Code Available 2TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Nov 26, 2024 image-classification Image Classification
Code Code Available 2Task Singular Vectors: Reducing Task Interference in Model Merging Nov 26, 2024 Classification Image Classification
Code Code Available 2EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality Nov 22, 2024 Efficient Neural Network Image Classification
Code Code Available 2BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models Nov 21, 2024 image-classification Image Classification
Code Code Available 2ScaleKD: Strong Vision Transformers Could Be Excellent Teachers Nov 11, 2024 image-classification Image Classification
Code Code Available 2Frontiers in Intelligent Colonoscopy Oct 22, 2024 Image Captioning
Code Code Available 2Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion Oct 19, 2024 image-classification Image Classification
Code Code Available 2CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling Sep 28, 2024 image-classification Image Classification
Code Code Available 2One missing piece in Vision and Language: A Survey on Comics Understanding Sep 14, 2024 document understanding image-classification
Code Code Available 2A Survey on Mixup Augmentations and Beyond Sep 8, 2024 Image Classification Self-Supervised Learning
Code Code Available 2PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation Sep 6, 2024 Benchmarking image-classification
Code Code Available 2The AdEMAMix Optimizer: Better, Faster, Older Sep 5, 2024 image-classification Image Classification
Code Code Available 23D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification Aug 25, 2024 Computational Efficiency Hyperspectral Image Classification
Code Code Available 2HAIR: Hypernetworks-based All-in-One Image Restoration Aug 15, 2024 5-Degradation Blind All-in-One Image Restoration All
Code Code Available 2SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Aug 15, 2024 Continual Learning image-classification
Code Code Available 2CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Aug 7, 2024 image-classification Image Classification
Code Code Available 2VSSD: Vision Mamba with Non-Causal State Space Duality Jul 26, 2024 image-classification Image Classification
Code Code Available 2LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Jul 25, 2024 Code Generation Computational Efficiency
Code Code Available 2GroupMamba: Efficient Group-Based Visual State Space Model Jul 18, 2024 image-classification Image Classification
Code Code Available 2DataDream: Few-shot Guided Dataset Generation Jul 15, 2024 Classification Dataset Generation
Code Code Available 2AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Jul 5, 2024 Action Recognition Few-Shot Image Classification
Code Code Available 2DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification Jul 4, 2024 Descriptive Diversity
Code Code Available 2GalLoP: Learning Global and Local Prompts for Vision-Language Models Jul 1, 2024 Diversity Domain Generalization
Code Code Available 2PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration Jun 28, 2024 image-classification Image Classification
Code Code Available 2Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Jun 25, 2024 cross-modal alignment Image Classification
Code Code Available 2TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning Jun 21, 2024 Fairness Geographic Question Answering
Code Code Available 2WATT: Weight Average Test-Time Adaptation of CLIP Jun 19, 2024 image-classification Image Classification
Code Code Available 2AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification Jun 18, 2024 Diversity image-classification
Code Code Available 2Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Jun 17, 2024 image-classification Image Classification
Code Code Available 2Unveiling the Power of Wavelets: A Wavelet-based Kolmogorov-Arnold Network for Hyperspectral Image Classification Jun 12, 2024 Hyperspectral Image Classification image-classification
Code Code Available 2Parameter-Inverted Image Pyramid Networks Jun 6, 2024 Computational Efficiency image-classification
Code Code Available 2GrootVL: Tree Topology is All You Need in State Space Model Jun 4, 2024 All image-classification
Code Code Available 2Why are Visually-Grounded Language Models Bad at Image Classification? May 28, 2024 Classification image-classification
Code Code Available 2AdaFisher: Adaptive Second Order Optimization via Fisher Information May 26, 2024 Computational Efficiency image-classification
Code Code Available 2Accelerating Transformers with Spectrum-Preserving Token Merging May 25, 2024 image-classification Image Classification
Code Code Available 2Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators May 23, 2024 image-classification Image Classification
Code Code Available 2EMR-Merging: Tuning-Free High-Performance Model Merging May 23, 2024 Image Classification Image Retrieval
Code Code Available 2Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification May 20, 2024 Hyperspectral Image Classification image-classification
Code Code Available 2SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization May 19, 2024 image-classification Image Classification
Code Code Available 2Many-Shot In-Context Learning in Multimodal Foundation Models May 16, 2024 image-classification Image Classification
Code Code Available 2GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs May 10, 2024 graph construction image-classification
Code Code Available 2