SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 701725 of 2042 papers

TitleStatusHype
AI Recommendation System for Enhanced Customer Experience: A Novel Image-to-Text Method0
Convolutional Models for Joint Object Categorization and Pose Estimation0
Convex Class Model on Symmetric Positive Definite Manifolds0
A shallow residual neural network to predict the visual cortex response0
Controlled Tactile Exploration and Haptic Object Recognition0
Controlled-rearing studies of newborn chicks and deep neural networks0
A semantics-driven methodology for high-quality image annotation0
AI-Powered GUI Attack and Its Defensive Methods0
Adaptation Across Extreme Variations using Unlabeled Domain Bridges0
6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping0
Contrastive Reasoning in Neural Networks0
A Self-supervised GAN for Unsupervised Few-shot Object Recognition0
Contrastive Object Detection Using Knowledge Graph Embeddings0
Learning Visual Models using a Knowledge Graph as a Trainer0
Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos0
AI-Powered Augmented Reality for Satellite Assembly, Integration and Test0
Artwork Recognition for Panorama Images Based on Optimized ASIFT and Cubic Projection0
Continual Learning for Pose-Agnostic Object Recognition in 3D Point Clouds0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling0
AI-Powered Assistive Technologies for Visual Impairment0
Continual-Learning-as-a-Service (CLaaS): On-Demand Efficient Adaptation of Predictive Models0
Continual Hyperbolic Learning of Instances and Classes0
ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding0
Contextual Recurrent Convolutional Model for Robust Visual Learning0
Show:102550
← PrevPage 29 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified