SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 10511100 of 2042 papers

TitleStatusHype
Applications of knowledge graphs for food science and industry0
Towards Instance Segmentation with Object Priority: Prominent Object Detection and Recognition0
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation0
Towards ontology driven learning of visual concept detectors0
Towards Open World Recognition0
Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors0
Towards real-time object recognition and pose estimation in point clouds0
Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation0
Towards the Design of an End-to-End Automated System for Image and Video-based Recognition0
Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time0
Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition0
TrackVLA: Embodied Visual Tracking in the Wild0
Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision0
Training Deep Spiking Neural Networks0
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts0
Training Skinny Deep Neural Networks with Iterative Hard Thresholding Methods0
Training the Untrainable: Introducing Inductive Bias via Representational Alignment0
Transferable Adversarial Attacks on Black-Box Vision-Language Models0
Transfer Learning for Material Classification using Convolutional Networks0
Transferred Fusion Learning using Skipped Networks0
Transferring Landmark Annotations for Cross-Dataset Face Alignment0
Transformational Sparse Coding0
Transformer-Based Microbubble Localization0
Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection0
Transformer in Touch: A Survey0
Transparency and Explanation in Deep Reinforcement Learning Neural Networks0
MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding0
TULIP: Towards Unified Language-Image Pretraining0
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model0
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking0
Understanding Adversarial Examples Through Deep Neural Network's Response Surface and Uncertainty Regions0
Understanding Bayesian Rooms Using Composite 3D Object Models0
Understanding How Blind Users Handle Object Recognition Errors: Strategies and Challenges0
Understanding Low- and High-Level Contributions to Fixation Prediction0
Understanding Tools: Task-Oriented Object Modeling, Learning and Recognition0
Unity of Opposites: SelfNorm and CrossNorm for Model Robustness0
Universal adversarial perturbation for remote sensing images0
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition0
Unsupervised AER Object Recognition Based on Multiscale Spatio-Temporal Features and Spiking Neurons0
Unsupervised Cross-Domain Recognition by Identifying Compact Joint Subspaces0
Unsupervised Domain Adaptation using Regularized Hyper-graph Matching0
Unsupervised Domain Adaptation using Graph Transduction Games0
Unsupervised feature learning by augmenting single images0
Unsupervised Feature Learning by Deep Sparse Coding0
Unsupervised Feature Learning for Event Data: Direct vs Inverse Problem Formulation0
Unsupervised Feature Learning with C-SVDDNet0
Unsupervised Foveal Vision Neural Networks with Top-Down Attention0
Unsupervised Generation of a Viewpoint Annotated Car Dataset From Videos0
Unsupervised Learning of Invariant Representations in Hierarchical Architectures0
Unsupervised Learning using Pretrained CNN and Associative Memory Bank0
Show:102550
← PrevPage 22 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified