SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 101125 of 2042 papers

TitleStatusHype
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNsCode1
Computing the Testing Error Without a Testing SetCode1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual DependencyCode1
Convolutional Neural Networks with Gated Recurrent ConnectionsCode1
Contributions of Shape, Texture, and Color in Visual RecognitionCode1
Domain Generalization for Object Recognition with Multi-task AutoencodersCode1
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking ConsistencyCode1
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy PredictionCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessmentCode1
Attribution in Scale and SpaceCode1
Doubly Right Object Recognition: A Why Prompt for Visual RationalesCode1
DaWin: Training-free Dynamic Weight Interpolation for Robust AdaptationCode1
Debiased Self-Training for Semi-Supervised LearningCode1
Decoding Natural Images from EEG for Object RecognitionCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
A Study of Face Obfuscation in ImageNetCode1
Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like?Code1
Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNetCode1
Deep Learning for Event-based Vision: A Comprehensive Survey and BenchmarksCode1
Adaptive Subspaces for Few-Shot LearningCode1
Look-into-Object: Self-supervised Structure Modeling for Object RecognitionCode1
DOCTOR: A Simple Method for Detecting Misclassification ErrorsCode1
Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-endsCode1
Dual-Hybrid Attention Network for Specular Highlight RemovalCode1
Show:102550
← PrevPage 5 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified