SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 151175 of 2042 papers

TitleStatusHype
FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing ImageryCode1
Computing the Testing Error without a Testing SetCode1
Comparison of semi-supervised deep learning algorithms for audio classificationCode1
Improving neural networks by preventing co-adaptation of feature detectorsCode1
F-SIOL-310: A Robotic Dataset and Benchmark for Few-Shot Incremental Object LearningCode1
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object DetectionCode1
Intriguing properties of generative classifiersCode1
A Study of Face Obfuscation in ImageNetCode1
Ev-TTA: Test-Time Adaptation for Event-Based Object RecognitionCode1
EventRPG: Event Data Augmentation with Relevance Propagation GuidanceCode1
Learning Efficient Coding of Natural Images with Maximum Manifold Capacity RepresentationsCode1
Learning Iterative Reasoning through Energy MinimizationCode1
Evolving Deep Neural NetworksCode1
Convolutional Neural Networks with Gated Recurrent ConnectionsCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy PredictionCode1
Expanding Event Modality Applications through a Robust CLIP-Based EncoderCode1
Are Convolutional Neural Networks or Transformers more like human vision?Code1
DaWin: Training-free Dynamic Weight Interpolation for Robust AdaptationCode1
EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge DistillationCode1
Look-into-Object: Self-supervised Structure Modeling for Object RecognitionCode1
LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Event-based Asynchronous Sparse Convolutional NetworksCode1
Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-endsCode1
Enriching ImageNet with Human Similarity Judgments and Psychological EmbeddingsCode1
Show:102550
← PrevPage 7 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified