SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 651675 of 2042 papers

TitleStatusHype
Efficient Codebook and Factorization for Second Order Representation Learning0
Efficient Estimation of Regularized Tyler's M-Estimator Using Approximate LOOCV0
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding0
Efficient Gesture Recognition for the Assistance of Visually Impaired People using Multi-Head Neural Networks0
A neuromorphic approach to image processing and machine vision0
Efficient Image Categorization with Sparse Fisher Vector0
Efficient Multi-Band Temporal Video Filter for Reducing Human-Robot Interaction0
Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition0
Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images0
Efficient Point-to-Subspace Query in ^1 with Application to Robust Object Instance Recognition0
Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition0
Egocentric Audio-Visual Noise Suppression0
Egocentric Height Estimation0
Egocentric Hierarchical Visual Semantics0
EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision0
Advancing Egocentric Video Question Answering with Multimodal Large Language Models0
Eigen-Distortions of Hierarchical Representations0
EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More0
Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition0
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices0
Embodied vision for learning object representations0
Emergent communication for AR0
EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition0
300 GHz Radar Object Recognition based on Deep Neural Networks and Transfer Learning0
Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching0
Show:102550
← PrevPage 27 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified