SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 751800 of 10419 papers

TitleStatusHype
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language ModelsCode1
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization0
Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification0
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives0
FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification0
Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models0
Undermining Image and Text Classification Algorithms Using Adversarial Attacks0
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis0
Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision0
Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification0
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty MeasurementCode0
MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN0
How many classifiers do we need?0
Retrieval-enriched zero-shot image classification in low-resource domains0
Class Incremental Learning with Task-Specific Batch Normalization and Out-of-Distribution Detection0
FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration DetectionCode0
Learning local discrete features in explainable-by-design convolutional neural networksCode0
Aerial Flood Scene Classification Using Fine-Tuned Attention-based Architecture for Flood-Prone Countries in South Asia0
Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image ClassificationCode0
Multilingual Vision-Language Pre-training for the Remote Sensing DomainCode0
Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks0
CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP0
Saliency-Based diversity and fairness Metric and FaceKeepOriginalAugment: A Novel Approach for Enhancing Fairness and Diversity0
Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm0
Breast Cancer Histopathology Classification using CBAM-EfficientNetV2 with Transfer Learning0
Bayesian Optimization for Hyperparameters Tuning in Neural Networks0
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image DatasetsCode0
Active Learning for Vision-Language Models0
FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake DetectionCode0
FewVS: A Vision-Semantics Integration Framework for Few-Shot Image ClassificationCode1
AiSciVision: A Framework for Specializing Large Multimodal Models in Scientific Image ClassificationCode0
Interpretable Image Classification with Adaptive Prototype-based Vision TransformersCode1
Historical Test-time Prompt Tuning for Vision Foundation Models0
Sequential Large Language Model-Based Hyper-parameter OptimizationCode0
Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits0
Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search0
A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERTCode0
OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery0
Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational ObjectiveCode0
Spatial-Temporal Search for Spiking Neural Networks0
Noise Adaption Network for Morse Code Image ClassificationCode0
A Combinatorial Approach to Neural Emergent Communication0
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing0
Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive LearningCode0
New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture0
Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers0
Benchmarking Large Language Models for Image Classification of Marine MammalsCode0
Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification0
Show:102550
← PrevPage 16 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified