SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 18761900 of 2042 papers

TitleStatusHype
Generalized Adaptive Dictionary Learning via Domain Shift Minimization0
A hierarchical framework for object recognition0
Learning visual biases from human imagination0
Efficient Image Categorization with Sparse Fisher Vector0
Zero-Shot Object Recognition System based on Topic Model0
HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual RecognitionCode0
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics0
1-HKUST: Object Detection in ILSVRC 20140
Domain Adaptive Neural Networks for Object Recognition0
Going Deeper with ConvolutionsCode1
Transferring Landmark Annotations for Cross-Dataset Face Alignment0
ImageNet Large Scale Visual Recognition ChallengeCode1
Learning Multi-Scale Representations for Material Classification0
Coloring Objects: Adjective-Noun Visual Semantic Compositionality0
A Poodle or a Dog? Evaluating Automatic Image Annotation Using Human Descriptions at Different Levels of Granularity0
See No Evil, Say No Evil: Description Generation from Densely Labeled Images0
Non-parametric Image Registration of Airborne LiDAR, Hyperspectral and Photographic Imagery of Forests0
Object Proposal Generation using Two-Stage Cascade SVMs0
Pixels to Voxels: Modeling Visual Representation in the Human Brain0
What you need to know about the state-of-the-art computational models of object-vision: A tour through the models0
Analyzing the Performance of Multilayer Neural Networks for Object Recognition0
Multiple Moving Object Recognitions in video based on Log Gabor-PCA Approach0
Face Identification with Second-Order Pooling0
3D ShapeNets: A Deep Representation for Volumetric ShapesCode1
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual RecognitionCode0
Show:102550
← PrevPage 76 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified