SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 426450 of 2042 papers

TitleStatusHype
Visual Story Generation Based on Emotion and KeywordsCode0
TempSAL -- Uncovering Temporal Information for Deep Saliency PredictionCode1
GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition0
Deep Learning from Parametrically Generated Virtual Buildings for Real-World Object Recognition0
Shift from Texture-bias to Shape-bias: Edge Deformation-based Augmentation for Robust Object Recognition0
Autonomous Manipulation Learning for Similar Deformable Objects via Only One Demonstration0
MISC210K: A Large-Scale Dataset for Multi-Instance Semantic CorrespondenceCode0
TempSAL - Uncovering Temporal Information for Deep Saliency PredictionCode1
Parsing Objects at a Finer Granularity: A Survey0
Part-guided Relational Transformers for Fine-grained Visual RecognitionCode1
Brain Cancer Segmentation Using YOLOv5 Deep Neural Network0
Decision-making and control with diffractive optical networksCode0
Improving Pre-Trained Weights Through Meta-Heuristics Fine-TuningCode0
ColorSense: A Study on Color Vision in Machine Visual Recognition0
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
OAMixer: Object-aware Mixing Layer for Vision TransformersCode0
Doubly Right Object Recognition: A Why Prompt for Visual RationalesCode1
A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations0
Teaching What You Should Teach: A Data-Based Distillation Method0
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions0
Beyond Object Recognition: A New Benchmark towards Object Concept Learning0
Recognizing Object by Components with Human Prior Knowledge Enhances Adversarial Robustness of Deep Neural NetworksCode0
PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain GeneralizationCode1
Extreme Image Transformations Affect Humans and Machines Differently0
Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition0
Show:102550
← PrevPage 18 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified