SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 16011650 of 2042 papers

TitleStatusHype
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation0
Learning Detailed Face Reconstruction from a Single Image0
Gradients of Counterfactuals0
Crowdsourcing in Computer Vision0
STDP-based spiking deep convolutional neural networks for object recognitionCode0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks0
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial ModelingCode0
Template Matching Advances and Applications in Image Analysis0
Recurrent 3D Attentional Networks for End-to-End Active Object Recognition0
Are Accuracy and Robustness Correlated?0
Multiple Instance Learning Convolutional Neural Networks for Object Recognition0
Tangled Splines0
Zero Shot Hashing0
Egocentric Height Estimation0
ResearchDoom and CocoDoom: Learning Computer Vision with Games0
DeepGaze II: Reading fixations from deep features trained on object recognition0
Recognizing Open-Vocabulary Relations between Objects in Images0
Kernel Methods on Approximate Infinite-Dimensional Covariance Operators for Image Classification0
Optimistic and Pessimistic Neural Networks for Scene and Object Recognition0
Transfer Learning for Material Classification using Convolutional Networks0
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised LocalizationCode0
Combining Texture and Shape Cues for Object Recognition With Minimal Supervision0
Reliable Attribute-Based Object Recognition Using High Predictive Value Classifiers0
Using Spatial Pooler of Hierarchical Temporal Memory to classify noisy videos with predefined complexity0
Investigating Fluidity for Human-Robot Interaction with Real-time, Real-world Grounding Strategies0
Ambient Sound Provides Supervision for Visual LearningCode0
Densely Connected Convolutional NetworksCode1
Towards Bayesian Deep Learning: A Framework and Some Existing Methods0
Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch RecognitionCode0
Deep Convolutional Neural Networks for Microscopy-Based Point of Care Diagnostics0
Adapting Deep Network Features to Capture Psychological RepresentationsCode0
Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations0
Combining Lexical and Spatial Knowledge to Predict Spatial Relations between Objects in Images0
Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis0
SwiDeN : Convolutional Neural Networks For Depiction Invariant Object RecognitionCode0
Feature Descriptors for Tracking by Detection: a Benchmark0
Improved Deep Learning of Object Category using Pose Information0
Learning to Recognize Objects by Retaining other Factors of Variation0
Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods0
FusionNet: 3D Object Classification Using Multiple Data Representations0
Distributed Coding of Multiview Sparse Sources with Joint Recovery0
Do semantic parts emerge in Convolutional Neural Networks?0
From Dependence to Causation0
Deep Reconstruction-Classification Networks for Unsupervised Domain AdaptationCode0
Enlightening Deep Neural Networks with Knowledge of Confounding Factors0
Captioning Images with Diverse ObjectsCode0
Saliency Driven Object recognition in egocentric videos with deep CNN0
Mutual Exclusivity Loss for Semi-Supervised Deep Learning0
Selective Unsupervised Feature Learning with Convolutional Neural Network (S-CNN)0
Show:102550
← PrevPage 33 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified