SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 201225 of 2042 papers

TitleStatusHype
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoCode1
Person Re-Identification with a Locally Aware TransformerCode1
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNsCode1
ProxyMix: Proxy-based Mixup Training with Label Refinery for Source-Free Domain AdaptationCode1
RAMP-CNN: A Novel Neural Network for Enhanced Automotive Radar Object RecognitionCode1
Recognize Any RegionsCode1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual DependencyCode1
Robust and Efficient Post-Processing for Video Object Detection (REPP)Code1
Causal Transportability for Visual RecognitionCode1
Billion-scale semi-supervised learning for image classificationCode1
Bilateral Event Mining and Complementary for Event Stream Super-ResolutionCode1
Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like?Code1
Self-supervised learning of video representations from a child's perspectiveCode1
Self-Supervised Learning with Kernel Dependence MaximizationCode1
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking ConsistencyCode1
SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action RecognitionCode1
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed DataCode1
Single Shot MC Dropout ApproximationCode1
Contributions of Shape, Texture, and Color in Visual RecognitionCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance SegmentationCode1
An Adaptive Sampling Scheme to Efficiently Train Fully Convolutional Networks for Semantic Segmentation0
An Adaptive Descriptor Design for Object Recognition in the Wild0
A biologically plausible network for the computation of orientation dominance0
A Multisensory Learning Architecture for Rotation-invariant Object Recognition0
Show:102550
← PrevPage 9 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified