SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 28512900 of 10419 papers

TitleStatusHype
Long-Range Feedback Spiking Network Captures Dynamic and Static Representations of the Visual Cortex under Movie StimuliCode0
Is Generative Modeling-based Stylization Necessary for Domain Adaptation in Regression Tasks?0
Break a Lag: Triple Exponential Moving Average for Enhanced Optimization0
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-TuningCode1
Vocabulary-free Image ClassificationCode1
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One DayCode4
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
LLMatic: Neural Architecture Search via Large Language Models and Quality Diversity OptimizationCode1
Addressing Discrepancies in Semantic and Visual Alignment in Neural Networks0
Pseudo Labels for Single Positive Multi-Label Learning0
Exploring the Versatility of Zero-Shot CLIP for Interstitial Lung Disease Classification0
Microstructure quality control of steels using deep learning0
Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shiftCode0
Adversarial-Aware Deep Learning System based on a Secondary Classical Machine Learning Verification Approach0
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training QuantizationCode0
Learning Across Decentralized Multi-Modal Remote Sensing Archives with Federated Learning0
GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?0
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
On the Limitations of Temperature Scaling for Distributions with OverlapsCode0
Training-free Neural Architecture Search for RNNs and TransformersCode1
Doubly Robust Self-TrainingCode0
Label-Retrieval-Augmented Diffusion Models for Learning from Noisy LabelsCode1
Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANNCode1
Bytes Are All You Need: Transformers Operating Directly On File Bytes0
Breast Cancer Detection and Diagnosis: A comparative study of state-of-the-arts deep learning architectures0
The Tunnel Effect: Building Data Representations in Deep Neural Networks0
Exploring Regions of Interest: Visualizing Histological Image Classification for Breast Cancer using Deep Learning0
Data Representations' Study of Latent Image ManifoldsCode0
A Computational Account Of Self-Supervised Visual Learning From Egocentric Object Play0
Machine learning with tree tensor networks, CP rank constraints, and tensor dropout0
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
Reduced Precision Floating-Point Optimization for Deep Neural Network On-Device Learning on MicroControllersCode1
A Rainbow in Deep Network Black BoxesCode1
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
A Transfer Learning and Explainable Solution to Detect mpox from Smartphones imagesCode0
GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-ray ClassificationCode1
Fourier Analysis on Robustness of Graph Convolutional Neural Networks for Skeleton-based Action RecognitionCode0
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language ModelsCode1
Deeply Coupled Cross-Modal Prompt LearningCode1
The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image ClassificationCode1
ASU-CNN: An Efficient Deep Architecture for Image Classification and Feature Visualizations0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
LowDINO -- A Low Parameter Self Supervised Learning ModelCode1
FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image RecognitionCode1
Learning from Children: Improving Image-Caption Pretraining via CurriculumCode0
Statistically Significant Concept-based Explanation of Image Classifiers via Model Knockoffs0
Image Quality Is Not All You Want: Task-Driven Lens Design for Image Classification0
Kernel Density Matrices for Probabilistic Deep LearningCode0
CNN Feature Map Augmentation for Single-Source Domain Generalization0
Sharpend Cosine Similarity based Neural Network for Hyperspectral Image Classification0
Show:102550
← PrevPage 58 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified