SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 376400 of 2042 papers

TitleStatusHype
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
Complete End-To-End Low Cost Solution To a 3D Scanning System with Integrated Turntable0
Complex-valued Iris Recognition Network0
Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?0
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion0
Compositional Embeddings for Multi-Label One-Shot Learning0
Compositional Hierarchical Tensor Factorization: Representing Hierarchical Intrinsic and Extrinsic Causal Factors0
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs0
Applications of Probabilistic Programming (Master's thesis, 2015)0
Compression of Deep Neural Networks on the Fly0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis0
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data0
Connecting metrics for shape-texture knowledge in computer vision0
Consistency of Silhouettes and Their Duals0
Instance Scale Normalization for image understanding0
Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models0
Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions0
Application of Faster R-CNN model on Human Running Pattern Recognition0
CONTEMPLATING REAL-WORLDOBJECT RECOGNITION0
Content Placement in Networks of Similarity Caches0
Context Augmentation for Convolutional Neural Networks0
Context-Dependent Diffusion Network for Visual Relationship Detection0
Affordance Labeling and Exploration: A Manifold-Based Approach0
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection0
Show:102550
← PrevPage 16 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified