SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 19011950 of 2042 papers

TitleStatusHype
Development of Open Informal Dataset Affecting Autonomous Driving0
DeViSE: A Deep Visual-Semantic Embedding Model0
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition0
Dimensionality Reduction for Data in Multiple Feature Representations0
DipMe: Haptic Recognition of Granular Media for Tangible Interactive Applications0
Direct Object Recognition Without Line-of-Sight Using Optical Coherence0
Disaggregated Deep Learning via In-Physics Computing at Radio Frequency0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
Discrete Potts Model for Generating Superpixels on Noisy Images0
Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets0
Discriminative Embedding Autoencoder with a Regressor Feedback for Zero-Shot Learning0
Discriminative Ferns Ensemble for Hand Pose Recognition0
Discriminatively Trained Sparse Code Gradients for Contour Detection0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
Disentangled Deep Autoencoding Regularization for Robust Image Classification0
Disentangling Properties of Contrastive Methods0
Distributed Coding of Multiview Sparse Sources with Joint Recovery0
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN0
Diversity in Object Proposals0
DNN Quantization with Attention0
Does Object Recognition Work for Everyone?0
Does Progress On Object Recognition Benchmarks Improve Real-World Generalization?0
Domain Adaptation on the Statistical Manifold0
Domain Adaptive Neural Networks for Object Recognition0
Domain Generalization: A Survey0
Domain-invariant Face Recognition using Learned Low-rank Transformation0
Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection0
DIRL: Domain-Invariant Representation Learning for Sim-to-Real Transfer0
Domain-robust VQA with diverse datasets and methods but no target labels0
Do semantic parts emerge in Convolutional Neural Networks?0
Do We Need More Training Data?0
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments0
Dreaming with ARC0
DTM: Deformable Template Matching0
Dual-attention Focused Module for Weakly Supervised Object Localization0
Dual coordinate solvers for large-scale structural SVMs0
Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery0
Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval0
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset0
DuEqNet: Dual-Equivariance Network in Outdoor 3D Object Detection for Autonomous Driving0
Duplex Generative Adversarial Network for Unsupervised Domain Adaptation0
DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment0
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation0
Dynamic reshaping of functional brain networks during visual object recognition0
Dynamic texture recognition using time-causal and time-recursive spatio-temporal receptive fields0
Easy Batch Normalization0
eCNN: A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference0
ECOR: Explainable CLIP for Object Recognition0
Edge Aware Learning for 3D Point Cloud0
Edge Detection Based Shape Identification0
Show:102550
← PrevPage 39 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified