SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 901925 of 2042 papers

TitleStatusHype
Hierarchical Novelty Detection for Visual Object Recognition0
Hierarchical Object Representation for Open-Ended Object Category Learning and Recognition0
Hierarchical Piecewise-Constant Super-regions0
Hierarchical Prototype Learning for Zero-Shot Recognition0
Data-driven Verification of DNNs for Object Recognition0
Higher-order Pooling of CNN Features via Kernel Linearization for Action Recognition0
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
Histograms of Pattern Sets for Image Classification and Object Recognition0
An Efficient Semi-Automated Scheme for Infrastructure LiDAR Annotation0
How Can CNNs Use Image Position for Segmentation?0
Image Captioning with Unseen Objects0
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses0
ImageNet MPEG-7 Visual Descriptors - Technical Report0
How good are deep models in understanding the generated images?0
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time0
DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability0
How Small is Big Enough? Open Labeled Datasets and the Development of Deep Learning0
How to deal with glare for improved perception of Autonomous Vehicles0
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data0
How well do deep neural networks trained on object recognition characterize the mouse visual system?0
Improving neural networks with bunches of neurons modeled by Kumaraswamy units: Preliminary study0
HR-RCNN: Hierarchical Relational Reasoning for Object Detection0
Human Action Recognition in Still Images Using ConViT0
Fractional order graph neural network0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
Show:102550
← PrevPage 37 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified