SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 17511800 of 2042 papers

TitleStatusHype
CloudFort: Enhancing Robustness of 3D Point Cloud Classification Against Backdoor Attacks via Spatial Partitioning and Ensemble Prediction0
Clustering Images by Unmasking - A New Baseline0
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis0
CNN-based search model underestimates attention guidance by simple visual features0
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data0
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs0
Collaboration Analysis Using Deep Learning0
Collaborative Descriptors: Convolutional Maps for Preprocessing0
Coloring Objects: Adjective-Noun Visual Semantic Compositionality0
Combinational neural network using Gabor filters for the classification of handwritten digits0
Combinatorial clustering and the beta negative binomial process0
Combined Approach for Image Segmentation0
Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition0
Combining Deep Transfer Learning with Signal-image Encoding for Multi-Modal Mental Wellbeing Classification0
Combining Lexical and Spatial Knowledge to Predict Spatial Relations between Objects in Images0
Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces0
Combining Texture and Shape Cues for Object Recognition With Minimal Supervision0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study0
Comparing Photorealism in Game Engines for Synthetic Maritime Computer Vision Datasets0
Complete End-To-End Low Cost Solution To a 3D Scanning System with Integrated Turntable0
Complex-valued Iris Recognition Network0
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion0
Compositional Embeddings for Multi-Label One-Shot Learning0
Compositional Hierarchical Tensor Factorization: Representing Hierarchical Intrinsic and Extrinsic Causal Factors0
Compressing Deep Convolutional Networks using Vector Quantization0
Compression of Deep Neural Networks on the Fly0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
Connecting metrics for shape-texture knowledge in computer vision0
Consistency of Silhouettes and Their Duals0
Instance Scale Normalization for image understanding0
Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models0
Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions0
CONTEMPLATING REAL-WORLDOBJECT RECOGNITION0
Content Placement in Networks of Similarity Caches0
Context Augmentation for Convolutional Neural Networks0
Context-Dependent Diffusion Network for Visual Relationship Detection0
Context-driven Visual Object Recognition based on Knowledge Graphs0
Context-LGM: Leveraging Object-Context Relation for Context-Aware Object Recognition0
Contextual Recurrent Convolutional Model for Robust Visual Learning0
Continual Hyperbolic Learning of Instances and Classes0
Continual-Learning-as-a-Service (CLaaS): On-Demand Efficient Adaptation of Predictive Models0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
Continual Learning for Pose-Agnostic Object Recognition in 3D Point Clouds0
Learning Visual Models using a Knowledge Graph as a Trainer0
Contrastive Object Detection Using Knowledge Graph Embeddings0
Contrastive Reasoning in Neural Networks0
Controlled-rearing studies of newborn chicks and deep neural networks0
Controlled Tactile Exploration and Haptic Object Recognition0
Convex Class Model on Symmetric Positive Definite Manifolds0
Show:102550
← PrevPage 36 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified