SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 901950 of 2042 papers

TitleStatusHype
Hierarchical Novelty Detection for Visual Object Recognition0
Hierarchical Object Representation for Open-Ended Object Category Learning and Recognition0
Hierarchical Piecewise-Constant Super-regions0
Hierarchical Prototype Learning for Zero-Shot Recognition0
Data-driven Verification of DNNs for Object Recognition0
Higher-order Pooling of CNN Features via Kernel Linearization for Action Recognition0
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
Histograms of Pattern Sets for Image Classification and Object Recognition0
Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets0
How Can CNNs Use Image Position for Segmentation?0
How Deep is the Feature Analysis underlying Rapid Visual Categorization?0
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses0
Discrete Potts Model for Generating Superpixels on Noisy Images0
How good are deep models in understanding the generated images?0
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time0
DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability0
How Small is Big Enough? Open Labeled Datasets and the Development of Deep Learning0
How to deal with glare for improved perception of Autonomous Vehicles0
How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes0
How well do deep neural networks trained on object recognition characterize the mouse visual system?0
Brain Cancer Segmentation Using YOLOv5 Deep Neural Network0
HR-RCNN: Hierarchical Relational Reasoning for Object Detection0
Human Action Recognition in Still Images Using ConViT0
Fractional order graph neural network0
Human-Inspired Topological Representations for Visual Object Recognition in Unseen Environments0
MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence0
An Efficient Semi-Automated Scheme for Infrastructure LiDAR Annotation0
Human peripheral blur is optimal for object recognition0
Deep Active Object Recognition by Joint Label and Action Prediction0
Humans and deep networks largely agree on which kinds of variation make object recognition harder0
Deep Affordance-grounded Sensorimotor Object Recognition0
Human vs. Computer in Scene and Object Recognition0
Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection0
Hybrid Predictive Coding: Inferring, Fast and Slow0
Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for Event-Based Vision0
Hyper-Class Augmented and Regularized Deep Learning for Fine-Grained Image Classification0
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data0
Hyper-parameter optimization of Deep Convolutional Networks for object recognition0
Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations0
Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
iCub World: Friendly Robots Help Building Good Vision Data-Sets0
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding0
Identifying Object States in Cooking-Related Images0
Identity documents recognition and detection using semantic segmentation with convolutional neural network0
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning0
Illumination-Invariant Image from 4-Channel Images: The Effect of Near-Infrared Data in Shadow Removal0
Deep convolutional neural networks for brain image analysis on magnetic resonance imaging: a review0
Image Captioning with Unseen Objects0
BPGrad: Towards Global Optimality in Deep Learning via Branch and Pruning0
Show:102550
← PrevPage 19 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified