SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 726750 of 2042 papers

TitleStatusHype
Opening Deep Neural Networks with Generative ModelsCode0
Superpixel-based Knowledge Infusion in Deep Neural Networks for Image ClassificationCode1
Are Convolutional Neural Networks or Transformers more like human vision?Code1
Brain Inspired Face Recognition: A Computational Framework0
SizeNet: Object Recognition via Object Real Size-based Convolutional Networks0
ORCEA: Object Recognition by Continuous Evidence Assimilation0
Modelling of LIDAR sensor disturbances by solid airborne particles0
BIM Hyperreality: Data Synthesis Using BIM and Hyperrealistic Rendering for Deep Learning0
This Looks Like That... Does it? Shortcomings of Latent Space Prototype Interpretability in Deep NetworksCode1
Canonical Saliency Maps: Decoding Deep Face ModelsCode0
Learning data association without data association: An EM approach to neural assignment prediction0
IPatch: A Remote Adversarial Patch0
ODDObjects: A Framework for Multiclass Unsupervised Anomaly Detection on Masked ObjectsCode0
RelTransformer: A Transformer-Based Long-Tail Visual Relationship RecognitionCode1
Recurrent Feedback Improves Recognition of Partially Occluded Objects0
SPARK: SPAcecraft Recognition leveraging Knowledge of Space Environment0
Rock Hunting With Martian Machine Vision0
ORBIT: A Real-World Few-Shot Dataset for Teachable Object RecognitionCode1
Artificial and beneficial -- Exploiting artificial images for aerial vehicle detection0
Tuned Compositional Feature Replays for Efficient Stream LearningCode0
Achieving Domain Generalization in Underwater Object Detection by Domain Mixup and Contrastive Learning0
Training Deep Neural Networks via Branch-and-BoundCode0
A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis0
Domain-robust VQA with diverse datasets and methods but no target labels0
CNN-based search model underestimates attention guidance by simple visual features0
Show:102550
← PrevPage 30 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified