SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 101125 of 2042 papers

TitleStatusHype
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNsCode1
CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessmentCode1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual DependencyCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
Decoding Natural Images from EEG for Object RecognitionCode1
IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language ReasoningCode1
Contributions of Shape, Texture, and Color in Visual RecognitionCode1
Implicit Feature Refinement for Instance SegmentationCode1
Contemplating real-world object classificationCode1
Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022Code1
Convolutional Neural Networks with Gated Recurrent ConnectionsCode1
Label-Free Event-based Object Recognition via Joint Learning with Image Reconstruction from EventsCode1
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNNCode1
LCD -- Line Clustering and Description for Place RecognitionCode1
Computing the Testing Error without a Testing SetCode1
Learning Iterative Reasoning through Energy MinimizationCode1
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object DetectionCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
Computing the Testing Error Without a Testing SetCode1
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object RecognitionCode1
Adaptive Subspaces for Few-Shot LearningCode1
Look-into-Object: Self-supervised Structure Modeling for Object RecognitionCode1
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy PredictionCode1
Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-endsCode1
Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNetCode1
Show:102550
← PrevPage 5 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified