SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 16511700 of 2042 papers

TitleStatusHype
Improved Deep Learning of Object Category using Pose Information0
FusionNet: 3D Object Classification Using Multiple Data Representations0
Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods0
Distributed Coding of Multiview Sparse Sources with Joint Recovery0
Do semantic parts emerge in Convolutional Neural Networks?0
From Dependence to Causation0
Deep Reconstruction-Classification Networks for Unsupervised Domain AdaptationCode0
Enlightening Deep Neural Networks with Knowledge of Confounding Factors0
Captioning Images with Diverse ObjectsCode0
Saliency Driven Object recognition in egocentric videos with deep CNN0
Mutual Exclusivity Loss for Semi-Supervised Deep Learning0
Selective Unsupervised Feature Learning with Convolutional Neural Network (S-CNN)0
View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation0
What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?0
How Deep is the Feature Analysis underlying Rapid Visual Categorization?0
On Recognizing Transparent Objects in Domestic Environments Using Fusion of Multiple Sensor Modalities0
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
Learning compact binary descriptors with unsupervised deep neural networksCode0
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition0
Interactive Segmentation on RGBD Images via Cue Selection0
Occlusion Boundary Detection via Deep Exploration of Context0
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition0
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks0
Modality and Component Aware Feature Fusion For RGB-D Scene Classification0
Consistency of Silhouettes and Their Duals0
Answer-Type Prediction for Visual Question Answering0
Pairwise Linear Regression Classification for Image Set Retrieval0
Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
Image Style Transfer Using Convolutional Neural NetworksCode0
Towards ontology driven learning of visual concept detectors0
Applications of Probabilistic Programming (Master's thesis, 2015)0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Latent Bi-constraint SVM for Video-based Object Recognition0
Parametric Exponential Linear Unit for Deep Convolutional Neural Networks0
Semi-supervised Zero-Shot Learning by a Clustering-based Approach0
Pairwise Decomposition of Image Sequences for Active Multi-View Recognition0
FPNN: Field Probing Neural Networks for 3D DataCode0
Hierarchical Piecewise-Constant Super-regions0
Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery0
Incremental Robot Learning of New Objects with Fixed Update TimeCode0
Going Deeper into Action Recognition: A Survey0
An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the WildCode0
A New Manifold Distance Measure for Visual Object Categorization0
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels0
Measurement Bounds for Sparse Signal Reconstruction with Multiple Side Information0
A Theoretical Analysis of Deep Neural Networks for Texture Classification0
Learning Attributes Equals Multi-Source Domain Generalization0
1 Million Captioned Dutch Newspaper Images0
Show:102550
← PrevPage 34 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified