SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 11761200 of 2042 papers

TitleStatusHype
Recurrent Connections Aid Occluded Object Recognition by Discounting Occluders0
Discriminative Embedding Autoencoder with a Regressor Feedback for Zero-Shot Learning0
Human Pose Estimation for Real-World Crowded ScenariosCode0
Mitigating the Hubness Problem for Zero-Shot Learning of 3D Objects0
A Performance Evaluation of Correspondence Grouping Methods for 3D Rigid Data Matching0
Region-Manipulated Fusion Networks for Pancreatitis Recognition0
Neural Image Captioning0
One Network for Multi-Domains: Domain Adaptive Hashing with Intersectant Generative Adversarial NetworkCode0
NetTailor: Tuning the Architecture, Not Just the WeightsCode0
Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching0
A shallow residual neural network to predict the visual cortex response0
Topology Maintained Structure Encoding0
Mutual exclusivity as a challenge for deep neural networks0
Integrating Knowledge and Reasoning in Image Understanding0
Intrinsic dimension estimation for locally undersampled dataCode0
MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks0
Learning Adaptive Classifiers Synthesis for Generalized Few-Shot LearningCode0
Learning Representations of Graph Data -- A Survey0
Does Object Recognition Work for Everyone?0
Farm land weed detection with region-based deep convolutional neural networks0
Adaptation Across Extreme Variations using Unlabeled Domain Bridges0
Efficient Codebook and Factorization for Second Order Representation Learning0
Dominant Set Clustering and Pooling for Multi-View 3D Object RecognitionCode0
Y-GAN: A Generative Adversarial Network for Depthmap Estimation from Multi-camera Stereo Images0
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis0
Show:102550
← PrevPage 48 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified