SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 20262042 of 2042 papers

TitleStatusHype
Visually Interpretable Subtask Reasoning for Visual Question AnsweringCode0
Robust Unsupervised Domain Adaptation for Neural Networks via Moment AlignmentCode0
The Cooperative Network Architecture: Learning Structured Networks as Representation of Sensory PatternsCode0
PANDA: Pose Aligned Networks for Deep Attribute ModelingCode0
Robust Visual Tracking via Hierarchical Convolutional FeaturesCode0
Looking Fast and Slow: Memory-Guided Mobile Video Object DetectionCode0
Domain Generalization by Solving Jigsaw PuzzlesCode0
Algorithms for Semantic Segmentation of Multispectral Remote Sensing Imagery using Deep LearningCode0
Look Twice: A Generalist Computational Model Predicts Return Fixations across Tasks and SpeciesCode0
Lost in Context: The Influence of Context on Feature Attribution Methods for Object RecognitionCode0
Spectral Illumination Correction: Achieving Relative Color Constancy Under the Spectral DomainCode0
Understanding and Visualizing Deep Visual Saliency ModelsCode0
Low-Shot Learning for the Semantic Segmentation of Remote Sensing ImageryCode0
Domain-aware Triplet loss in Domain GeneralizationCode0
AGA: Attribute-Guided AugmentationCode0
LVLM-COUNT: Enhancing the Counting Ability of Large Vision-Language ModelsCode0
Visual Probing and Correction of Object Recognition Models with Interactive user feedbackCode0
Show:102550
← PrevPage 82 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified