SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 801850 of 2042 papers

TitleStatusHype
Are Face and Object Recognition Independent? A Neurocomputational Modeling Exploration0
Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations0
Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels0
Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition0
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion0
Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?0
Forecasting Hands and Objects in Future Frames0
Fashioning with Networks: Neural Style Transfer to Design Clothes0
Farm land weed detection with region-based deep convolutional neural networks0
Fourier descriptors based on the structure of the human primary visual cortex with applications to object recognition0
Foveated Downsampling Techniques0
Complex-valued Iris Recognition Network0
Complete End-To-End Low Cost Solution To a 3D Scanning System with Integrated Turntable0
Factorization of View-Object Manifolds for Joint Object Recognition and Pose Estimation0
Face-space Action Recognition by Face-Object Interactions0
Face processing emerges from object-trained convolutional neural networks0
From the Virtual to the RealWorld: Referring to Objects in Real-World Spatial Scenes0
From Virtual to Real: A Framework for Verbal Interaction with Robots0
Comparing Photorealism in Game Engines for Synthetic Maritime Computer Vision Datasets0
From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis0
A recurrent multi-scale approach to RBG-D Object Recognition0
A Convolutional Neural Network based Live Object Recognition System as Blind Aid0
Face Identification with Second-Order Pooling0
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset0
Fully-Connected CRFs with Non-Parametric Pairwise Potential0
FusionNet: 3D Object Classification Using Multiple Data Representations0
Fusion of Inverse Synthetic Aperture Radar and Camera Images for Automotive Target Tracking0
Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study0
EZSR: Event-based Zero-Shot Recognition0
GBCNs: Genetic Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
A Real-time Junk Food Recognition System based on Machine Learning0
Extreme Image Transformations Affect Humans and Machines Differently0
Generalization Boosted Adapter for Open-Vocabulary Segmentation0
Generalized Adaptive Dictionary Learning via Domain Shift Minimization0
Generalized K-fan Multimodal Deep Model with Shared Representations0
Generalized Lasso based Approximation of Sparse Coding for Visual Recognition0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Exponential Discriminative Metric Embedding in Deep Learning0
The Origins and Prevalence of Texture Bias in Convolutional Neural Networks0
Generating Clear Images From Images With Distortions Caused by Adverse Weather Using Generative Adversarial Networks0
Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models0
Are Accuracy and Robustness Correlated?0
Exploring Temporal Differences in 3D Convolutional Neural Networks0
Exploring Context and Visual Pattern of Relationship for Scene Graph Generation0
GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing0
Exploration of object recognition from 3D point cloud0
A randomized gradient-free attack on ReLU networks0
Show:102550
← PrevPage 17 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified