SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 18011825 of 2042 papers

TitleStatusHype
SegICP: Integrated Deep Semantic Segmentation and Pose EstimationCode0
Multiple Object Recognition with Visual AttentionCode0
SharpNet: Fast and Accurate Recovery of Occluding Contours in Monocular Depth EstimationCode0
Multi-level 3D CNN for Learning Multi-scale Spatial FeaturesCode0
Image Captioning using Deep Neural ArchitecturesCode0
Improved object recognition using neural networks trained to mimic the brain's statistical propertiesCode0
Multiscale Dubuc: A New Similarity Measure for Time SeriesCode0
Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object RecognitionCode0
A Dataset for Crucial Object Recognition in Blind and Low-Vision Individuals' NavigationCode0
ImageNet Classification with Deep Convolutional Neural NetworksCode0
Food Image Recognition by Using Convolutional Neural Networks (CNNs)Code0
Prediction Surface Uncertainty Quantification in Object Detection Models for Autonomous DrivingCode0
Fit to Measure: Reasoning about Sizes for Robust Object RecognitionCode0
Multi-stage Deep Classifier Cascades for Open World RecognitionCode0
Unsupervised Domain Adaptation using Feature-Whitening and Consensus LossCode0
Image Privacy Prediction Using Deep Neural NetworksCode0
Privacy Leakage of SIFT Features via Deep Generative Model based Image ReconstructionCode0
Image Style Transfer Using Convolutional Neural NetworksCode0
Imagine2touch: Predictive Tactile Sensing for Robotic Manipulation using Efficient Low-Dimensional SignalsCode0
SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous DrivingCode0
Self-supervised Domain Adaptation for Computer Vision TasksCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Probing Multimodal Large Language Models for Global and Local Semantic RepresentationsCode0
Transfer Learning based Detection of Diabetic Retinopathy from Small DatasetCode0
MVP-Bench: Can Large Vision--Language Models Conduct Multi-level Visual Perception Like Humans?Code0
Show:102550
← PrevPage 73 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified