SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 12011250 of 2042 papers

TitleStatusHype
Graph-based Asynchronous Event Processing for Rapid Object Recognition0
Graph-Based High-Order Relation Discovery for Fine-Grained Recognition0
Graph Convolutional Networks for Classification with a Structured Label Space0
GFCN: A New Graph Convolutional Network Based on Parallel Flows0
Label Efficient Regularization and Propagation for Graph Node Classification0
Graphical Gaussian Vector for Image Categorization0
GraspCaps: A Capsule Network Approach for Familiar 6DoF Object Grasping0
Grassmannian learning mutual subspace method for image set recognition0
Grounded Language Acquisition From Object and Action Imagery0
Guided SAM: Label-Efficient Part Segmentation0
GuideMe: A Mobile Application based on Global Positioning System and Object Recognition Towards a Smart Tourist Guide0
Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements0
Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains0
Hand Gestures Recognition in Videos Taken with Lensless Camera0
Hand-Object Interaction and Precise Localization in Transitive Action Recognition0
Hand-Priming in Object Localization for Assistive Egocentric Vision0
Haptic in-sensor computing device made of carbon nanotube-polydimethylsiloxane nanocomposites0
Hardening RGB-D Object Recognition Systems against Adversarial Patch Attacks0
Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review0
Hardware Implementation of Hyperbolic Tangent Function using Catmull-Rom Spline Interpolation0
HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition0
Hebbian Semi-Supervised Learning in a Sample Efficiency Setting0
HeteroEdge: Addressing Asymmetry in Heterogeneous Collaborative Autonomous Systems0
HFirst: A Temporal Approach to Object Recognition0
Hidden Patch Attacks for Optical Flow0
Hierarchical Convolutional Features for Visual Tracking0
Hierarchical Deep Learning Architecture For 10K Objects Classification0
Hierarchical Feature Hashing for Fast Dimensionality Reduction0
Hierarchically Compositional Tasks and Deep Convolutional Networks0
Hierarchical Modular Optimization of Convolutional Networks Achieves Representations Similar to Macaque IT and Human Ventral Stream0
Hierarchical Neural Representation of Dreamed Objects Revealed by Brain Decoding with Deep Neural Network Features0
Hierarchical Novelty Detection for Visual Object Recognition0
Hierarchical Object Representation for Open-Ended Object Category Learning and Recognition0
Hierarchical Piecewise-Constant Super-regions0
Hierarchical Prototype Learning for Zero-Shot Recognition0
Higher-order Pooling of CNN Features via Kernel Linearization for Action Recognition0
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
Histograms of Pattern Sets for Image Classification and Object Recognition0
MetaSketch: Wireless Semantic Segmentation by Metamaterial Surfaces0
How Can CNNs Use Image Position for Segmentation?0
How Deep is the Feature Analysis underlying Rapid Visual Categorization?0
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses0
How does task structure shape representations in deep neural networks?0
How good are deep models in understanding the generated images?0
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time0
How Small is Big Enough? Open Labeled Datasets and the Development of Deep Learning0
How to deal with glare for improved perception of Autonomous Vehicles0
How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes0
How well do deep neural networks trained on object recognition characterize the mouse visual system?0
HRItk: The Human-Robot Interaction ToolKit Rapid Development of Speech-Centric Interactive Systems in ROS0
Show:102550
← PrevPage 25 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified