Parameter-Efficient Fine-Tuning with Discrete Fourier Transform May 5, 2024 image-classification Image Classification
Code Code Available 2MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning May 4, 2024 Earth Observation image-classification
Code Code Available 2S^2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification Apr 28, 2024 Hyperspectral Image Classification image-classification
Code Code Available 2An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training Apr 18, 2024 Contrastive Learning CPU
Code Code Available 2Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models Apr 16, 2024 image-classification Image Classification
Code Code Available 2HGRN2: Gated Linear RNNs with State Expansion Apr 11, 2024 Image Classification Language Modeling
Code Code Available 2Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss Apr 2, 2024 image-classification Image Classification
Code Code Available 2DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Mar 28, 2024 Fine-Grained Image Classification Image Classification
Code Code Available 2Continual Forgetting for Pre-trained Vision Models Mar 18, 2024 Continual Forgetting Face Recognition
Code Code Available 2Trainable Fractional Fourier Transform Mar 4, 2024 image-classification Image Classification
Code Code Available 2SURE: SUrvey REcipes for building reliable and robust deep networks Mar 1, 2024 image-classification Image Classification
Code Code Available 2DEYO: DETR with YOLO for End-to-End Object Detection Feb 26, 2024 Decoder GPU
Code Code Available 2SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design Jan 29, 2024 CPU GPU
Code Code Available 2Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Jan 17, 2024 GPU Image Classification
Code Code Available 2UV-SAM: Adapting Segment Anything Model for Urban Village Identification Jan 16, 2024 image-classification Image Classification
Code Code Available 2Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image Classification Jan 11, 2024 Contrastive Learning image-classification
Code Code Available 2Learning Vision from Models Rivals Learning Vision from Data Dec 28, 2023 Contrastive Learning Image Captioning
Code Code Available 2State-of-the-Art in Nudity Classification: A Comparative Analysis Dec 26, 2023 Classification image-classification
Code Code Available 2Agent Attention: On the Integration of Softmax and Linear Attention Dec 14, 2023 Computational Efficiency image-classification
Code Code Available 2TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models Dec 1, 2023 Image Classification Multi-Object Tracking
Code Code Available 2TransNeXt: Robust Foveal Visual Perception for Vision Transformers Nov 28, 2023 Classification Domain Generalization
Code Code Available 2Adapter is All You Need for Tuning Visual Tasks Nov 25, 2023 All image-classification
Code Code Available 2TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition Oct 30, 2023 Image Classification Object Detection
Code Code Available 2Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture Oct 18, 2023 4k image-classification
Code Code Available 2CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Oct 2, 2023 image-classification Image Classification
Code Code Available 2DAT++: Spatially Dynamic Vision Transformer with Deformable Attention Sep 4, 2023 Image Classification Instance Segmentation
Code Code Available 2RevColV2: Exploring Disentangled Representations in Masked Image Modeling Sep 2, 2023 Decoder image-classification
Code Code Available 2RemoteCLIP: A Vision Language Foundation Model for Remote Sensing Jun 19, 2023 Classification Cross-Modal Retrieval
Code Code Available 2MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification Jun 16, 2023 Diabetic Retinopathy Grading image-classification
Code Code Available 2NodeFormer: A Scalable Graph Structure Learning Transformer for Node Classification Jun 14, 2023 Graph structure learning image-classification
Code Code Available 2FasterViT: Fast Vision Transformers with Hierarchical Attention Jun 9, 2023 Image Classification object-detection
Code Code Available 2Efficient Multi-Scale Attention Module with Cross-Spatial Learning May 23, 2023 Dimensionality Reduction image-classification
Code Code Available 2Unicom: Universal and Compact Representation Learning for Image Retrieval Apr 12, 2023 Image Classification Image Retrieval
Code Code Available 2Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition Apr 10, 2023 image-classification Image Classification
Code Code Available 2Your Diffusion Model is Secretly a Zero-Shot Classifier Mar 28, 2023 Domain Generalization Fine-Grained Image Classification
Code Code Available 2BiFormer: Vision Transformer with Bi-Level Routing Attention Mar 15, 2023 Computational Efficiency GPU
Code Code Available 2PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents Mar 13, 2023 image-classification Image Classification
Code Code Available 2CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention Mar 13, 2023 image-classification Image Classification
Code Code Available 2Stabilizing Transformer Training by Preventing Attention Entropy Collapse Mar 11, 2023 Automatic Speech Recognition image-classification
Code Code Available 2RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data Feb 28, 2023 Density Estimation image-classification
Code Code Available 2MedViT: A Robust Vision Transformer for Generalized Medical Image Classification Feb 19, 2023 image-classification Image Classification
Code Code Available 2Stitchable Neural Networks Feb 13, 2023 Image Classification
Code Code Available 2Simple Hardware-Efficient Long Convolutions for Sequence Modeling Feb 13, 2023 GPU image-classification
Code Code Available 2Understanding Why ViT Trains Badly on Small Datasets: An Intuitive Perspective Feb 7, 2023 Image Classification
Code Code Available 2Effective Data Augmentation With Diffusion Models Feb 7, 2023 Data Augmentation Diversity
Code Code Available 2Class-Incremental Learning: A Survey Feb 7, 2023 class-incremental learning Class Incremental Learning
Code Code Available 2SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition Jan 30, 2023 Feature Upsampling image-classification
Code Code Available 2Reversible Column Networks Dec 22, 2022 image-classification Image Classification
Code Code Available 2MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation Dec 2, 2022 Domain Adaptation image-classification
Code Code Available 2CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels Nov 25, 2022 image-classification Image Classification
Code Code Available 2