SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 601650 of 2042 papers

TitleStatusHype
A newborn embodied Turing test for view-invariant object recognition0
BSED: Baseline Shapley-Based Explainable Detector0
Domain Adaptation on the Statistical Manifold0
Domain Adaptive Neural Networks for Object Recognition0
Building Machines That Learn and Think Like People0
Domain Generalization: A Survey0
Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation0
Adversarial Attack on Facial Recognition using Visible Light0
Empowering Knowledge Distillation via Open Set Recognition for Robust 3D Point Cloud Classification0
A New Urban Objects Detection Framework Using Weakly Annotated Sets0
C3PO: Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing0
Domain-invariant Face Recognition using Learned Low-rank Transformation0
Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection0
DIRL: Domain-Invariant Representation Learning for Sim-to-Real Transfer0
Domain-robust VQA with diverse datasets and methods but no target labels0
Can Boosting with SVM as Week Learners Help?0
Can domain adaptation make object recognition work for everyone?0
1 Million Captioned Dutch Newspaper Images0
Do semantic parts emerge in Convolutional Neural Networks?0
Bridging the Gap Between Computational Photography and Visual Recognition0
Do We Need More Training Data?0
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments0
Dreaming with ARC0
DTM: Deformable Template Matching0
Dual-attention Focused Module for Weakly Supervised Object Localization0
Dual coordinate solvers for large-scale structural SVMs0
Empowering Local Communities Using Artificial Intelligence0
Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery0
Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval0
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset0
DuEqNet: Dual-Equivariance Network in Outdoor 3D Object Detection for Autonomous Driving0
Duplex Generative Adversarial Network for Unsupervised Domain Adaptation0
DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment0
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation0
DNN Quantization with Attention0
Cascade Region Proposal and Global Context for Deep Object Detection0
Dynamic reshaping of functional brain networks during visual object recognition0
Dynamic texture recognition using time-causal and time-recursive spatio-temporal receptive fields0
Diversity in Object Proposals0
Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition0
Categories and Functional Units: An Infinite Hierarchical Model for Brain Activations0
eCNN: A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference0
A neuromorphic approach to image processing and machine vision0
Edge Aware Learning for 3D Point Cloud0
Edge Detection Based Shape Identification0
EdgeOL: Efficient in-situ Online Learning on Edge Devices0
Advancing Egocentric Video Question Answering with Multimodal Large Language Models0
Effects of Real-Life Traffic Sign Alteration on YOLOv7- an Object Recognition Model0
Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition0
300 GHz Radar Object Recognition based on Deep Neural Networks and Transfer Learning0
Show:102550
← PrevPage 13 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified