SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 14511475 of 2042 papers

TitleStatusHype
Dynamic texture recognition using time-causal and time-recursive spatio-temporal receptive fields0
Graph Convolutional Networks for Classification with a Structured Label Space0
Eigen-Distortions of Hierarchical Representations0
A concatenating framework of shortcut convolutional neural networks0
Understanding Low- and High-Level Contributions to Fixation Prediction0
Deep Scene Image Classification With the MFAFVNet0
Deep Competitive Pathway NetworksCode0
Are we done with object recognition? The iCub robot's perspectiveCode0
A Generic Regression Framework for Pose Recognition on Color and Depth Images0
Combinational neural network using Gabor filters for the classification of handwritten digits0
Recognizing Objects In-the-wild: Where Do We Stand?0
Object Recognition from very few Training Examples for Enhancing Bicycle Maps0
Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions0
An Adaptive Sampling Scheme to Efficiently Train Fully Convolutional Networks for Semantic Segmentation0
Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach0
Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition0
Distributed Deep Neural Networks over the Cloud, the Edge and End DevicesCode1
Complete End-To-End Low Cost Solution To a 3D Scanning System with Integrated Turntable0
ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids0
ScatterNet Hybrid Deep Learning (SHDL) Network For Object Classification0
Automatic Dataset Augmentation0
One-Shot Concept Learning by Simulating Evolutionary Instinct Development0
CNN Fixations: An unraveling approach to visualize the discriminative image regionsCode0
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects0
Knock-Knock: Acoustic Object Recognition by using Stacked Denoising Autoencoders0
Show:102550
← PrevPage 59 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified