SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 826850 of 2042 papers

TitleStatusHype
Canonical Saliency Maps: Decoding Deep Face ModelsCode0
Learning data association without data association: An EM approach to neural assignment prediction0
IPatch: A Remote Adversarial Patch0
ODDObjects: A Framework for Multiclass Unsupervised Anomaly Detection on Masked ObjectsCode0
Recurrent Feedback Improves Recognition of Partially Occluded Objects0
SPARK: SPAcecraft Recognition leveraging Knowledge of Space Environment0
Rock Hunting With Martian Machine Vision0
Artificial and beneficial -- Exploiting artificial images for aerial vehicle detection0
Tuned Compositional Feature Replays for Efficient Stream LearningCode0
Achieving Domain Generalization in Underwater Object Detection by Domain Mixup and Contrastive Learning0
Training Deep Neural Networks via Branch-and-BoundCode0
A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis0
CNN-based search model underestimates attention guidance by simple visual features0
Domain-robust VQA with diverse datasets and methods but no target labels0
VDM-DA: Virtual Domain Modeling for Source Data-free Domain Adaptation0
Projection: A Mechanism for Human-like Reasoning in Artificial Intelligence0
DNN Quantization with Attention0
Contrastive Reasoning in Neural Networks0
Neural Networks for Semantic Gaze Analysis in XR Settings0
Hebbian Semi-Supervised Learning in a Sample Efficiency Setting0
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation0
Structure-From-Motion and RGBD Depth Fusion0
Point Cloud Sampling via Graph Balancing and Gershgorin Disc Alignment0
A Discriminative Vectorial Framework for Multi-modal Feature Representation0
Offboard 3D Object Detection from Point Cloud Sequences0
Show:102550
← PrevPage 34 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified