K-LITE: Learning Transferable Visual Models with External Knowledge Apr 20, 2022 Benchmarking Descriptive
Code Code Available 25 HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions Jul 28, 2022 Image Classification Object Detection
Code Code Available 25 HAIR: Hypernetworks-based All-in-One Image Restoration Aug 15, 2024 5-Degradation Blind All-in-One Image Restoration All
Code Code Available 25 HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification Sep 21, 2022 Classification image-classification
Code Code Available 25 Inception Transformer May 25, 2022 image-classification Image Classification
Code Code Available 25 LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Jul 25, 2024 Code Generation Computational Efficiency
Code Code Available 25 Global Context Vision Transformers Jun 20, 2022 image-classification Image Classification
Code Code Available 25 GIT: A Generative Image-to-text Transformer for Vision and Language May 27, 2022 Decoder Image Captioning
Code Code Available 25 GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism Nov 16, 2018 Fine-Grained Image Classification image-classification
Code Code Available 25 MobileOne: An Improved One millisecond Mobile Backbone Jun 8, 2022 Efficient Neural Network Gaze Estimation
Code Code Available 25 GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 25 HGRN2: Gated Linear RNNs with State Expansion Apr 11, 2024 Image Classification Language Modeling
Code Code Available 25 GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs May 10, 2024 graph construction image-classification
Code Code Available 25 Adapter is All You Need for Tuning Visual Tasks Nov 25, 2023 All image-classification
Code Code Available 25 GalLoP: Learning Global and Local Prompts for Vision-Language Models Jul 1, 2024 Diversity Domain Generalization
Code Code Available 25 Generalized Parametric Contrastive Learning Sep 26, 2022 Contrastive Learning Domain Generalization
Code Code Available 25 FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning Dec 16, 2024 DeepFake Detection diffusion-generated faces detection
Code Code Available 25 Frontiers in Intelligent Colonoscopy Oct 22, 2024 Image Captioning
Code Code Available 25 Generative Pretraining from Pixels Jul 17, 2020 Image Classification Representation Learning
Code Code Available 25 A Simple Framework for Contrastive Learning of Visual Representations Feb 13, 2020 Contrastive Learning Image Classification
Code Code Available 25 LambdaNetworks: Modeling Long-Range Interactions Without Attention Feb 17, 2021 image-classification Image Classification
Code Code Available 25 ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks Feb 23, 2021 Image Classification
Code Code Available 25 A Simple Episodic Linear Probe Improves Visual Recognition in the Wild Jan 1, 2022 Fine-Grained Image Classification Image Classification
Code Code Available 25 Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image Classification Jan 11, 2024 Contrastive Learning image-classification
Code Code Available 25 SCAN: Learning to Classify Images without Labels May 25, 2020 Classification Clustering
Code Code Available 25 Learning Transferable Visual Models From Natural Language Supervision Feb 26, 2021 Action Recognition Benchmarking
Code Code Available 25 GrootVL: Tree Topology is All You Need in State Space Model Jun 4, 2024 All image-classification
Code Code Available 25 FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence Jan 21, 2020 Image Classification Pseudo Label
Code Code Available 25 Fixing the train-test resolution discrepancy: FixEfficientNet Mar 18, 2020 Data Augmentation Image Classification
Code Code Available 25 FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence Jan 21, 2020 Image Classification Pseudo Label
Code Code Available 25 AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients Oct 15, 2020 image-classification Image Classification
Code Code Available 25 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification Aug 25, 2024 Computational Efficiency Hyperspectral Image Classification
Code Code Available 25 Fixing the train-test resolution discrepancy Jun 14, 2019 Data Augmentation Fine-Grained Image Classification
Code Code Available 25 FasterViT: Fast Vision Transformers with Hierarchical Attention Jun 9, 2023 Image Classification object-detection
Code Code Available 25 Fast Vision Transformers with HiLo Attention May 26, 2022 Benchmarking Efficient ViTs
Code Code Available 25 An Overview of Deep Semi-Supervised Learning Jun 9, 2020 Deep Learning image-classification
Code Code Available 25 AdaFisher: Adaptive Second Order Optimization via Fisher Information May 26, 2024 Computational Efficiency image-classification
Code Code Available 25 ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification Jan 21, 2022 BIG-bench Machine Learning image-classification
Code Code Available 25 Focal Modulation Networks Mar 22, 2022 image-classification Image Classification
Code Code Available 25 GroupMamba: Efficient Group-Based Visual State Space Model Jul 18, 2024 image-classification Image Classification
Code Code Available 25 Efficient Multi-Scale Attention Module with Cross-Spatial Learning May 23, 2023 Dimensionality Reduction image-classification
Code Code Available 25 MogaNet: Multi-order Gated Aggregation Network Nov 7, 2022 3D Human Pose Estimation Image Classification
Code Code Available 25 EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality Nov 22, 2024 Efficient Neural Network Image Classification
Code Code Available 25 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification Dec 1, 2024 Computational Efficiency image-classification
Code Code Available 25 UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery Sep 18, 2021 Change Detection Decoder
Code Code Available 25 Effective Data Augmentation With Diffusion Models Feb 7, 2023 Data Augmentation Diversity
Code Code Available 25 EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications Jun 21, 2022 Image Classification Object Detection
Code Code Available 25 Dilated Neighborhood Attention Transformer Sep 29, 2022 Image Classification Instance Segmentation
Code Code Available 25 Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources Jan 4, 2022 Domain Adaptation domain classification
Code Code Available 25 DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification Jul 4, 2024 Descriptive Diversity
Code Code Available 25