SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 826850 of 2042 papers

TitleStatusHype
FusionNet: 3D Object Classification Using Multiple Data Representations0
Fusion of Inverse Synthetic Aperture Radar and Camera Images for Automotive Target Tracking0
Contrastive Reasoning in Neural Networks0
Hand Gestures Recognition in Videos Taken with Lensless Camera0
GBCNs: Genetic Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs0
Context-Aware Zero-Shot Learning for Object Recognition0
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured0
Grounded Language Acquisition From Object and Action Imagery0
Generalization Boosted Adapter for Open-Vocabulary Segmentation0
Generalized Adaptive Dictionary Learning via Domain Shift Minimization0
Generalized K-fan Multimodal Deep Model with Shared Representations0
Generalized Lasso based Approximation of Sparse Coding for Visual Recognition0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Convolutional Networks with Dense Connectivity0
A ``Shape Aware'' Model for semi-supervised Learning of Objects and its Context0
Generating Clear Images From Images With Distortions Caused by Adverse Weather Using Generative Adversarial Networks0
Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
Discriminatively Trained Sparse Code Gradients for Contour Detection0
Brain-Like Object Recognition Neural Networks are more robustness to common corruptions0
Generic decoding of seen and imagined objects using hierarchical visual features0
Guided SAM: Label-Efficient Part Segmentation0
GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing0
Convolutional Prototype Learning for Zero-Shot Recognition0
Discriminative Ferns Ensemble for Hand Pose Recognition0
Show:102550
← PrevPage 34 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified