SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 851900 of 2042 papers

TitleStatusHype
Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study0
EZSR: Event-based Zero-Shot Recognition0
Glitch Classification and Clustering for LIGO with Deep Transfer Learning0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
Global Deconvolutional Networks for Semantic Segmentation0
A Real-time Junk Food Recognition System based on Machine Learning0
Going Deeper into Action Recognition: A Survey0
Extreme Image Transformations Affect Humans and Machines Differently0
Gradient-based Laplacian Feature Selection0
Gradients of Counterfactuals0
Exponential Discriminative Metric Embedding in Deep Learning0
Graph-Based High-Order Relation Discovery for Fine-Grained Recognition0
The Origins and Prevalence of Texture Bias in Convolutional Neural Networks0
GFCN: A New Graph Convolutional Network Based on Parallel Flows0
Label Efficient Regularization and Propagation for Graph Node Classification0
Graphical Gaussian Vector for Image Categorization0
GraspCaps: A Capsule Network Approach for Familiar 6DoF Object Grasping0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Grassmannian learning mutual subspace method for image set recognition0
Are Accuracy and Robustness Correlated?0
Exploring Temporal Differences in 3D Convolutional Neural Networks0
Exploring Context and Visual Pattern of Relationship for Scene Graph Generation0
Exploration of object recognition from 3D point cloud0
Guided SAM: Label-Efficient Part Segmentation0
GuideMe: A Mobile Application based on Global Positioning System and Object Recognition Towards a Smart Tourist Guide0
Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements0
A randomized gradient-free attack on ReLU networks0
Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains0
Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks0
Hand-Object Interaction and Precise Localization in Transitive Action Recognition0
Hand-Priming in Object Localization for Assistive Egocentric Vision0
Exploiting Temporal Relations on Radar Perception for Autonomous Driving0
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks0
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation0
Combining Texture and Shape Cues for Object Recognition With Minimal Supervision0
Hardware Implementation of Hyperbolic Tangent Function using Catmull-Rom Spline Interpolation0
A Random-Fern based Feature Approach for Image Matching0
A Framework For Refining Text Classification and Object Recognition from Academic Articles0
A concatenating framework of shortcut convolutional neural networks0
Hebbian Semi-Supervised Learning in a Sample Efficiency Setting0
HeteroEdge: Addressing Asymmetry in Heterogeneous Collaborative Autonomous Systems0
HFirst: A Temporal Approach to Object Recognition0
3D Object Recognition with Deep Belief Nets0
Exploiting an Oracle that Reports AUC Scores in Machine Learning Contests0
Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces0
Hierarchical Deep Learning Architecture For 10K Objects Classification0
Exploit Bounding Box Annotations for Multi-label Object Recognition0
Hierarchically Compositional Tasks and Deep Convolutional Networks0
Hierarchical Modular Optimization of Convolutional Networks Achieves Representations Similar to Macaque IT and Human Ventral Stream0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Show:102550
← PrevPage 18 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified