SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 401425 of 2042 papers

TitleStatusHype
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised LocalizationCode0
ContextMix: A context-aware data augmentation method for industrial visual inspection systemsCode0
Lost in Context: The Influence of Context on Feature Attribution Methods for Object RecognitionCode0
Continual egocentric object recognitionCode0
Facial Expression Recognition Research Based on Deep LearningCode0
Captioning Images with Diverse ObjectsCode0
EXOT: Exit-aware Object Tracker for Safe Robotic Manipulation of Moving ObjectCode0
Canonical Saliency Maps: Decoding Deep Face ModelsCode0
Continual Learning in Neural NetworksCode0
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated InteractionsCode0
Experiments with mmWave Automotive Radar Test-bedCode0
Can Large Language Models Grasp Event Signals? Exploring Pure Zero-Shot Event-based RecognitionCode0
Ensemble learning in CNN augmented with fully connected subnetworksCode0
Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch RecognitionCode0
End-to-End Learning of Representations for Asynchronous Event-Based DataCode0
BViT: Broad Attention based Vision TransformerCode0
3D_DEN: Open-ended 3D Object Recognition using Dynamically Expandable NetworksCode0
Multiple Object Recognition with Visual AttentionCode0
Multi-stage Deep Classifier Cascades for Open World RecognitionCode0
MVP-Bench: Can Large Vision--Language Models Conduct Multi-level Visual Perception Like Humans?Code0
Enhancing Pollinator Conservation towards Agriculture 4.0: Monitoring of Bees through Object RecognitionCode0
Dynamic Rectification Knowledge DistillationCode0
Do Pre-trained Vision-Language Models Encode Object States?Code0
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training AcceleratorsCode0
Domain Generalization via Model-Agnostic Learning of Semantic FeaturesCode0
Show:102550
← PrevPage 17 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified