SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 651700 of 2042 papers

TitleStatusHype
A Multi-viewpoint Outdoor Dataset for Human Action RecognitionCode0
MetaCOG: A Hierarchical Probabilistic Model for Learning Meta-Cognitive Visual RepresentationsCode0
Empowering Local Communities Using Artificial Intelligence0
The Challenge of Appearance-Free Object Tracking with Feedforward Neural Networks0
Self-Supervised Modality-Invariant and Modality-Specific Feature Learning for 3D Objects0
NODEAttack: Adversarial Attack on the Energy Consumption of Neural ODEs0
Disentangling Properties of Contrastive Methods0
Lifelong 3D Object Recognition and Grasp Synthesis Using Dual Memory Recurrent Self-Organization NetworksCode0
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?Code0
Superquadric Object Representation for Optimization-based Semantic SLAM0
Class incremental learning for video action classification0
ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations0
Object recognition for robotics from tactile time series data utilising different neural network architectures0
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker IdentificationCode0
Temporal RoI Align for Video Object Recognition0
Datasets: A Community Library for Natural Language ProcessingCode3
Capturing the objects of vision with neural networks0
Voxel Transformer for 3D Object DetectionCode1
Recognition Awareness: An Application of Latent Cognizance to Open-Set Recognition0
An Underwater Image Semantic Segmentation Method Focusing on Boundaries and a Real Underwater Scene Semantic Segmentation DatasetCode0
Airplane Type Identification Based on Mask RCNN; An Approach to Reduce Airport Traffic Congestion0
Reducing Label Effort: Self-Supervised meets Active Learning0
OOWL500: Overcoming Dataset Collection Bias in the Wild0
Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks0
VisBuddy -- A Smart Wearable Assistant for the Visually Challenged0
MetaSketch: Wireless Semantic Segmentation by Metamaterial Surfaces0
Patchwork: Concentric Zone-based Region-wise Ground Segmentation with Ground Likelihood Estimation Using a 3D LiDAR SensorCode1
Learning Canonical 3D Object Representation for Fine-Grained Recognition0
An optical biomimetic eyes with interested object imaging0
A Framework for Multi-View Classification of Features0
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study0
To Boost or not to Boost: On the Limits of Boosted Neural Networks0
Language Models as Zero-shot Visual Semantic Learners0
A 51.3 TOPS/W, 134.4 GOPS In-memory Binary Image Filtering in 65nm CMOS0
Deep Machine Learning Based Egyptian Vehicle License Plate Recognition Systems0
Saliency for free: Saliency prediction as a side-effect of object recognition0
Spectral Processing and Optimization of Static and Dynamic 3D Geometries0
The Foes of Neural Network's Data Efficiency Among Unnecessary Input Dimensions0
Applications of knowledge graphs for food science and industry0
Prediction Surface Uncertainty Quantification in Object Detection Models for Autonomous DrivingCode0
On the Challenges of Open World Recognitionunder Shifting Visual DomainsCode1
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question AnsweringCode1
MixStyle Neural Networks for Domain Generalization and Adaptation0
Hebbian learning with gradients: Hebbian convolutional neural networks with modern deep learning frameworksCode1
Parasitic Egg Detection and Classification in Low-cost Microscopic Images using Transfer Learning0
Understanding Adversarial Examples Through Deep Neural Network's Response Surface and Uncertainty Regions0
In-distribution adversarial attacks on object recognition models using gradient-free searchCode0
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
SISA: Securing Images by Selective Alteration0
Show:102550
← PrevPage 14 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified