SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 14511500 of 2042 papers

TitleStatusHype
What's in my Room? Object Recognition on Indoor Panoramic Images0
What takes the brain so long: Object recognition at the level of minimal images develops for up to seconds of presentation time0
What you need to know about the state-of-the-art computational models of object-vision: A tour through the models0
When Regression Meets Manifold Learning for Object Recognition and Pose Estimation0
Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks0
Why The Brain Separates Face Recognition From Object Recognition0
Winograd Convolution for Deep Neural Networks: Efficient Point Selection0
X-model: Improving Data Efficiency in Deep Learning with A Minimax Model0
YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation0
YOLO and K-Means Based 3D Object Detection Method on Image and Point Cloud0
You Only Speak Once to See0
Your head is there to move you around: Goal-driven models of the primate dorsal pathway0
Zero-Aliasing Correlation Filters for Object Recognition0
Zero-Knowledge Zero-Shot Learning for Novel Visual Category Discovery0
Zero-shot counting with a dual-stream neural network model0
Zero Shot Hashing0
Zero-Shot Learning from Adversarial Feature Residual to Compact Visual Feature0
Zero-shot Learning with Deep Neural Networks for Object Recognition0
Zero-shot object prediction using semantic scene knowledge0
Zero-Shot Object Recognition by Semantic Manifold Distance0
Zero-Shot Object Recognition System based on Topic Model0
Zero-shot recognition with unreliable attributes0
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages?0
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM0
ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices0
Context-Aware Zero-Shot Learning for Object Recognition0
PCA-RECT: An Energy-efficient Object Detection Approach for Event Cameras0
VGG Fine-tuning for Cooking State Recognition0
Y-GAN: A Generative Adversarial Network for Depthmap Estimation from Multi-camera Stereo Images0
1-HKUST: Object Detection in ILSVRC 20140
1 Million Captioned Dutch Newspaper Images0
A General Ambiguity Model for Binary Edge Images with Edge Tracing and its Implementation0
Source-Free Domain-Invariant Performance Prediction0
MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence0
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision0
AW-GATCN: Adaptive Weighted Graph Attention Convolutional Network for Event Camera Data Joint Denoising and Object Recognition0
2D Image Features Detector And Descriptor Selection Expert System0
300 GHz Radar Object Recognition based on Deep Neural Networks and Transfer Learning0
3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data0
3D Object Detection Method Based on YOLO and K-Means for Image and Point Clouds0
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations0
3D Object Recognition with Deep Belief Nets0
6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping0
A 51.3 TOPS/W, 134.4 GOPS In-memory Binary Image Filtering in 65nm CMOS0
AAD: Adaptive Anomaly Detection through traffic surveillance videos0
A Bayesian Framework for Active Tactile Object Recognition, Pose Estimation and Shape Transfer Learning0
A Benchmark Grocery Dataset of Realworld Point Clouds From Single View0
A biologically plausible network for the computation of orientation dominance0
Abnormal Object Recognition: A Comprehensive Study0
Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses0
Show:102550
← PrevPage 30 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified