SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 351400 of 2042 papers

TitleStatusHype
DAS: A Deformable Attention to Capture Salient Information in CNNs0
A Proper Orthogonal Decomposition approach for parameters reduction of Single Shot Detector networks0
A Framework for Multi-View Classification of Features0
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations0
Combinatorial clustering and the beta negative binomial process0
A Probabilistic Framework for Dynamic Object Recognition in 3D Environment With A Novel Continuous Ground Estimation Method0
Combinational neural network using Gabor filters for the classification of handwritten digits0
Coloring Objects: Adjective-Noun Visual Semantic Compositionality0
A priori compression of convolutional neural networks for wave simulators0
A Fog Robotic System for Dynamic Visual Servoing0
A General Ambiguity Model for Binary Edge Images with Edge Tracing and its Implementation0
Combined Approach for Image Segmentation0
Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition0
Combining Deep Transfer Learning with Signal-image Encoding for Multi-Modal Mental Wellbeing Classification0
Combining Lexical and Spatial Knowledge to Predict Spatial Relations between Objects in Images0
Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces0
Approximation of dilation-based spatial relations to add structural constraints in neural networks0
Collaborative Descriptors: Convolutional Maps for Preprocessing0
Co-training Transformer with Videos and Images Improves Action Recognition0
A randomized gradient-free attack on ReLU networks0
Are Accuracy and Robustness Correlated?0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Collaboration Analysis Using Deep Learning0
Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
Complete End-To-End Low Cost Solution To a 3D Scanning System with Integrated Turntable0
Complex-valued Iris Recognition Network0
Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?0
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion0
Compositional Embeddings for Multi-Label One-Shot Learning0
Compositional Hierarchical Tensor Factorization: Representing Hierarchical Intrinsic and Extrinsic Causal Factors0
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs0
Applications of Probabilistic Programming (Master's thesis, 2015)0
Compression of Deep Neural Networks on the Fly0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis0
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data0
Connecting metrics for shape-texture knowledge in computer vision0
Consistency of Silhouettes and Their Duals0
Instance Scale Normalization for image understanding0
Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models0
Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions0
Application of Faster R-CNN model on Human Running Pattern Recognition0
CONTEMPLATING REAL-WORLDOBJECT RECOGNITION0
Content Placement in Networks of Similarity Caches0
Context Augmentation for Convolutional Neural Networks0
Context-Dependent Diffusion Network for Visual Relationship Detection0
Affordance Labeling and Exploration: A Manifold-Based Approach0
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection0
Show:102550
← PrevPage 8 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified