SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 14511475 of 2042 papers

TitleStatusHype
What's in my Room? Object Recognition on Indoor Panoramic Images0
What takes the brain so long: Object recognition at the level of minimal images develops for up to seconds of presentation time0
What you need to know about the state-of-the-art computational models of object-vision: A tour through the models0
When Regression Meets Manifold Learning for Object Recognition and Pose Estimation0
Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks0
Why The Brain Separates Face Recognition From Object Recognition0
Winograd Convolution for Deep Neural Networks: Efficient Point Selection0
X-model: Improving Data Efficiency in Deep Learning with A Minimax Model0
YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation0
YOLO and K-Means Based 3D Object Detection Method on Image and Point Cloud0
You Only Speak Once to See0
Your head is there to move you around: Goal-driven models of the primate dorsal pathway0
Zero-Aliasing Correlation Filters for Object Recognition0
Zero-Knowledge Zero-Shot Learning for Novel Visual Category Discovery0
Zero-shot counting with a dual-stream neural network model0
Zero Shot Hashing0
Zero-Shot Learning from Adversarial Feature Residual to Compact Visual Feature0
Zero-shot Learning with Deep Neural Networks for Object Recognition0
Zero-shot object prediction using semantic scene knowledge0
Zero-Shot Object Recognition by Semantic Manifold Distance0
Zero-Shot Object Recognition System based on Topic Model0
Zero-shot recognition with unreliable attributes0
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages?0
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM0
ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices0
Show:102550
← PrevPage 59 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified