SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 16511700 of 2042 papers

TitleStatusHype
What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?0
View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation0
On Recognizing Transparent Objects in Domestic Environments Using Fusion of Multiple Sensor Modalities0
How Deep is the Feature Analysis underlying Rapid Visual Categorization?0
Learning compact binary descriptors with unsupervised deep neural networksCode0
Modality and Component Aware Feature Fusion For RGB-D Scene Classification0
Image Style Transfer Using Convolutional Neural NetworksCode0
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning0
Interactive Segmentation on RGBD Images via Cue Selection0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition0
Occlusion Boundary Detection via Deep Exploration of Context0
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks0
Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent0
Answer-Type Prediction for Visual Question Answering0
Consistency of Silhouettes and Their Duals0
Pairwise Linear Regression Classification for Image Set Retrieval0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition0
Latent Bi-constraint SVM for Video-based Object Recognition0
Applications of Probabilistic Programming (Master's thesis, 2015)0
Towards ontology driven learning of visual concept detectors0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Parametric Exponential Linear Unit for Deep Convolutional Neural Networks0
Semi-supervised Zero-Shot Learning by a Clustering-based Approach0
Pairwise Decomposition of Image Sequences for Active Multi-View Recognition0
Deep Predictive Coding Networks for Video Prediction and Unsupervised LearningCode1
FPNN: Field Probing Neural Networks for 3D DataCode0
Hierarchical Piecewise-Constant Super-regions0
Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery0
Incremental Robot Learning of New Objects with Fixed Update TimeCode0
Going Deeper into Action Recognition: A Survey0
An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the WildCode0
A New Manifold Distance Measure for Visual Object Categorization0
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels0
Measurement Bounds for Sparse Signal Reconstruction with Multiple Side Information0
A Theoretical Analysis of Deep Neural Networks for Texture Classification0
Learning Attributes Equals Multi-Source Domain Generalization0
1 Million Captioned Dutch Newspaper Images0
Zero-shot object prediction using semantic scene knowledge0
Are Face and Object Recognition Independent? A Neurocomputational Modeling Exploration0
Modeling the Contribution of Central Versus Peripheral Vision in Scene, Object, and Face Recognition0
Humans and deep networks largely agree on which kinds of variation make object recognition harder0
Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks0
Can Boosting with SVM as Week Learners Help?0
Invariant feature extraction from event based stimuli0
DTM: Deformable Template Matching0
Orientation-boosted Voxel Nets for 3D Object Recognition0
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from VideosCode0
Edge Detection Based Shape Identification0
Show:102550
← PrevPage 34 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified