SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 501525 of 2042 papers

TitleStatusHype
Dynamic Rectification Knowledge DistillationCode0
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from VideosCode0
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training AcceleratorsCode0
A Dataset for Crucial Object Recognition in Blind and Low-Vision Individuals' NavigationCode0
Efficient Event Stream Super-Resolution with Recursive Multi-Branch FusionCode0
Texture Synthesis Using Convolutional Neural NetworksCode0
Dominant Set Clustering and Pooling for Multi-View 3D Object RecognitionCode0
An Analysis of Unsupervised Pre-training in Light of Recent AdvancesCode0
Don't Judge by the Look: Towards Motion Coherent Video RepresentationCode0
Domain Generalization via Model-Agnostic Learning of Semantic FeaturesCode0
Do Pre-trained Vision-Language Models Encode Object States?Code0
Experiments with mmWave Automotive Radar Test-bedCode0
Domain-aware Triplet loss in Domain GeneralizationCode0
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech RecognitionCode0
Domain Generalization by Solving Jigsaw PuzzlesCode0
Mixed Evidence for Gestalt Grouping in Deep Neural NetworksCode0
Do Deep Neural Networks Suffer from Crowding?Code0
Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configurationCode0
Domain Generalization by Solving Jigsaw PuzzlesCode0
Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object ClassificationCode0
Task-generalizable Adversarial Attack based on Perceptual MetricCode0
Do deep nets really need weight decay and dropout?Code0
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOSCode0
Temporal-Coded Deep Spiking Neural Network with Easy Training and Robust PerformanceCode0
Directional Regularized Tensor Modeling for Video Rain Streaks RemovalCode0
Show:102550
← PrevPage 21 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified