SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 16511675 of 2042 papers

TitleStatusHype
What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?0
View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation0
On Recognizing Transparent Objects in Domestic Environments Using Fusion of Multiple Sensor Modalities0
How Deep is the Feature Analysis underlying Rapid Visual Categorization?0
Learning compact binary descriptors with unsupervised deep neural networksCode0
Modality and Component Aware Feature Fusion For RGB-D Scene Classification0
Image Style Transfer Using Convolutional Neural NetworksCode0
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning0
Interactive Segmentation on RGBD Images via Cue Selection0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition0
Occlusion Boundary Detection via Deep Exploration of Context0
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks0
Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent0
Answer-Type Prediction for Visual Question Answering0
Consistency of Silhouettes and Their Duals0
Pairwise Linear Regression Classification for Image Set Retrieval0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition0
Latent Bi-constraint SVM for Video-based Object Recognition0
Applications of Probabilistic Programming (Master's thesis, 2015)0
Towards ontology driven learning of visual concept detectors0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Parametric Exponential Linear Unit for Deep Convolutional Neural Networks0
Semi-supervised Zero-Shot Learning by a Clustering-based Approach0
Show:102550
← PrevPage 67 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified