SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 301325 of 2042 papers

TitleStatusHype
Proactive Adversarial Defense: Harnessing Prompt Tuning in Vision-Language Models to Detect Unseen Backdoored Images0
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs0
Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis0
Can foundation models actively gather information in interactive environments to test hypotheses?0
Optimized CNNs for Rapid 3D Point Cloud Object Recognition0
LVLM-COUNT: Enhancing the Counting Ability of Large Vision-Language ModelsCode0
Textured As-Is BIM via GIS-informed Point Cloud Segmentation0
Verbalized Representation Learning for Interpretable Few-Shot GeneralizationCode0
Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agentsCode0
NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?0
Comparing Photorealism in Game Engines for Synthetic Maritime Computer Vision Datasets0
Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation0
ViSTa Dataset: Do vision-language models understand sequential tasks?Code0
Value-Spectrum: Quantifying Preferences of Vision-Language Models via Value Decomposition in Social Media ContextsCode0
LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection0
Multiscale Dubuc: A New Similarity Measure for Time SeriesCode0
Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction0
DipMe: Haptic Recognition of Granular Media for Tangible Interactive Applications0
Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language ModelsCode0
Scaling Laws for Task-Optimized Models of the Primate Visual Ventral StreamCode0
Object Recognition in Human Computer Interaction:- A Comparative Analysis0
Lost in Context: The Influence of Context on Feature Attribution Methods for Object RecognitionCode0
Active Gaze Behavior Boosts Self-Supervised Object Learning0
Learning Where to Edit Vision TransformersCode0
Investigating the Gestalt Principle of Closure in Deep Convolutional Neural NetworksCode0
Show:102550
← PrevPage 13 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified