SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 10511075 of 2042 papers

TitleStatusHype
Applications of knowledge graphs for food science and industry0
Towards Instance Segmentation with Object Priority: Prominent Object Detection and Recognition0
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation0
Towards ontology driven learning of visual concept detectors0
Towards Open World Recognition0
Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors0
Towards real-time object recognition and pose estimation in point clouds0
Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation0
Towards the Design of an End-to-End Automated System for Image and Video-based Recognition0
Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time0
Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition0
TrackVLA: Embodied Visual Tracking in the Wild0
Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision0
Training Deep Spiking Neural Networks0
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts0
Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods0
Training the Untrainable: Introducing Inductive Bias via Representational Alignment0
Transferable Adversarial Attacks on Black-Box Vision-Language Models0
Transfer Learning for Material Classification using Convolutional Networks0
Transferred Fusion Learning using Skipped Networks0
Transferring Landmark Annotations for Cross-Dataset Face Alignment0
Transformational Sparse Coding0
Transformer-Based Microbubble Localization0
Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection0
Transformer in Touch: A Survey0
Show:102550
← PrevPage 43 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified