SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 12511300 of 2042 papers

TitleStatusHype
HR-RCNN: Hierarchical Relational Reasoning for Object Detection0
Human Action Recognition in Still Images Using ConViT0
Human-Inspired Topological Representations for Visual Object Recognition in Unseen Environments0
Human-like general language processing0
Human peripheral blur is optimal for object recognition0
Humans and deep networks largely agree on which kinds of variation make object recognition harder0
Human vs. Computer in Scene and Object Recognition0
Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection0
Hybrid Predictive Coding: Inferring, Fast and Slow0
Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for Event-Based Vision0
Hyper-Class Augmented and Regularized Deep Learning for Fine-Grained Image Classification0
Hyper-parameter optimization of Deep Convolutional Networks for object recognition0
ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning0
iCub World: Friendly Robots Help Building Good Vision Data-Sets0
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding0
Identifying Object States in Cooking-Related Images0
Identity documents recognition and detection using semantic segmentation with convolutional neural network0
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning0
Illumination-Invariant Image from 4-Channel Images: The Effect of Near-Infrared Data in Shadow Removal0
Image Captioning with Unseen Objects0
Image classification using local tensor singular value decompositions0
Image Description using Visual Dependency Representations0
Image Enhancement and Object Recognition for Night Vision Surveillance0
ImageNet MPEG-7 Visual Descriptors - Technical Report0
ImageNet-trained deep neural network exhibits illusion-like response to the Scintillating Grid0
Image Optimization and Prediction0
Image Retrieval based on Bag-of-Words model0
Imaging-free object recognition enabled by optical coherence0
Immersive Language Exploration with Object Recognition and Augmented Reality0
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data0
Improved Deep Learning of Object Category using Pose Information0
Improved Deep Metric Learning with Multi-class N-pair Loss Objective0
Improved Few-Shot Visual Classification0
Improved Inception-Residual Convolutional Neural Network for Object Recognition0
Improved training of binary networks for human pose estimation and image recognition0
Improved visible to IR image transformation using synthetic data augmentation with cycle-consistent adversarial networks0
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data0
Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information0
Enhancing Few-Shot Image Classification with Unlabelled Examples0
Improving Gibbs Sampler Scan Quality with DoGS0
Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations0
Improving neural networks with bunches of neurons modeled by Kumaraswamy units: Preliminary study0
Improving Performance of Object Detection using the Mechanisms of Visual Recognition in Humans0
Improving Robustness of Feature Representations to Image Deformations using Powered Convolution in CNNs0
Improving the Accuracy and Robustness of CNNs Using a Deep CCA Neural Data Regularizer0
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects0
Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition0
Incorporating Network Built-in Priors in Weakly-supervised Semantic Segmentation0
Incorporating Semantic Attention in Video Description Generation0
Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection0
Show:102550
← PrevPage 26 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified