5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Aug 15, 2024 image-classification Image Classification
Code Code Available 3Diffusion Feedback Helps CLIP See Better Jul 29, 2024 image-classification Image Classification
Code Code Available 3TCFormer: Visual Recognition via Token Clustering Transformer Jul 16, 2024 Clustering image-classification
Code Code Available 3xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart Jul 1, 2024 3D Medical Imaging Segmentation image-classification
Code Code Available 3FusionBench: A Comprehensive Benchmark of Deep Model Fusion Jun 5, 2024 image-classification Image Classification
Code Code Available 3Demystify Mamba in Vision: A Linear Attention Perspective May 26, 2024 image-classification Image Classification
Code Code Available 3MobileNetV4 -- Universal Models for the Mobile Ecosystem Apr 16, 2024 Image Classification Neural Architecture Search
Code Code Available 3RSMamba: Remote Sensing Image Classification with State Space Model Mar 28, 2024 Classification image-classification
Code Code Available 3PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Mar 26, 2024 Image Classification Instance Segmentation
Code Code Available 3MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Mar 20, 2024 Aerial Scene Classification Building change detection for remote sensing images
Code Code Available 3VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks Mar 1, 2024 Image Classification Image Generation
Code Code Available 3Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey Feb 8, 2024 Articles Entity Alignment
Code Code Available 3Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket Jan 4, 2024 image-classification Image Classification
Code Code Available 3MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices Dec 28, 2023 AutoML CPU
Code Code Available 3SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery Dec 15, 2023 Contrastive Learning Earth Observation
Code Code Available 3UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition Nov 27, 2023 Image Classification Object Detection
Code Code Available 3ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities May 18, 2023 1 Image, 2*2 Stitchi Action Classification
Code Code Available 3FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization Mar 24, 2023 3D Hand Pose Estimation GPU
Code Code Available 3Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling Jan 9, 2023 2D Object Detection Contrastive Learning
Code Code Available 3MetaFormer Baselines for Vision Oct 24, 2022 Domain Generalization Image Classification
Code Code Available 3Vision-Language Pre-training: Basics, Recent Advances, and Future Trends Oct 17, 2022 Few-Shot Learning Image Captioning
Code Code Available 3Vision Transformers: From Semantic Segmentation to Dense Prediction Jul 19, 2022 image-classification Image Classification
Code Code Available 3Separable Self-attention for Mobile Vision Transformers Jun 6, 2022 Image Classification Object Detection
Code Code Available 3MiniViT: Compressing Vision Transformers with Weight Multiplexing Apr 14, 2022 Diversity Image Classification
Code Code Available 3MaxViT: Multi-Axis Vision Transformer Apr 4, 2022 image-classification Image Classification
Code Code Available 3Visual Prompt Tuning Mar 23, 2022 Image Classification Long-tail Learning
Code Code Available 3QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning Feb 26, 2022 image-classification Image Classification
Code Code Available 3DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 3Patches Are All You Need? Jan 24, 2022 All Image Classification
Code Code Available 3Transformers in Medical Imaging: A Survey Jan 24, 2022 Image Classification Image Segmentation
Code Code Available 3Detecting Twenty-thousand Classes using Image-level Supervision Jan 7, 2022 Cross-Domain Few-Shot Object Detection image-classification
Code Code Available 3Datasets: A Community Library for Natural Language Processing Sep 7, 2021 Image Classification Object Recognition
Code Code Available 3XCiT: Cross-Covariance Image Transformers Jun 17, 2021 image-classification Image Classification
Code Code Available 3EfficientNetV2: Smaller Models and Faster Training Apr 1, 2021 AutoML Classification
Code Code Available 3U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection May 18, 2020 Dichotomous Image Segmentation GPU
Code Code Available 3ResNeSt: Split-Attention Networks Apr 19, 2020 image-classification Image Classification
Code Code Available 3Momentum Contrast for Unsupervised Visual Representation Learning Nov 13, 2019 Contrastive Learning Image Classification
Code Code Available 3Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 3EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks May 28, 2019 Action Recognition Domain Generalization
Code Code Available 3Bag of Freebies for Training Object Detection Neural Networks Feb 11, 2019 General Classification image-classification
Code Code Available 3AutoAugment: Learning Augmentation Policies from Data May 24, 2018 Data Augmentation Domain Generalization
Code Code Available 3GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 2Optimal Weighted Convolution for Classification and Denosing May 30, 2025 Classification Denoising
Code Code Available 2Towards Practical Second-Order Optimizers in Deep Learning: Insights from Fisher Information Analysis Apr 26, 2025 Computational Efficiency image-classification
Code Code Available 2Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning Mar 20, 2025 Classification Few-Shot Learning
Code Code Available 2UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly Detection Feb 28, 2025 Anomaly Detection Image Classification
Code Code Available 2Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Feb 24, 2025 image-classification Image Classification
Code Code Available 2Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention Feb 19, 2025 image-classification Image Classification
Code Code Available 2DAMamba: Vision State Space Model with Dynamic Adaptive Scan Feb 18, 2025 image-classification Image Classification
Code Code Available 2ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification Feb 12, 2025 Decoder Descriptive
Code Code Available 2