SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 901950 of 2042 papers

TitleStatusHype
Privacy Leakage of SIFT Features via Deep Generative Model based Image ReconstructionCode0
Deep Learning Techniques for Geospatial Data Analysis0
Zero-Shot Learning from Adversarial Feature Residual to Compact Visual Feature0
All About Knowledge Graphs for Actions0
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds0
Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting for Label ImbalanceCode0
Vision at A Glance: Interplay between Fine and Coarse Information Processing Pathways0
Line-Circle-Square (LCS): A Multilayered Geometric Filter for Edge-Based DetectionCode0
A Self-supervised GAN for Unsupervised Few-shot Object Recognition0
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance SegmentationCode1
Offline Meta-Reinforcement Learning with Advantage WeightingCode1
PAM:Point-wise Attention Module for 6D Object Pose Estimation0
Adversarial Examples on Object Recognition: A Comprehensive Survey0
Webly Supervised Semantic Embeddings for Large Scale Zero-Shot Learning0
Active Perception using Light Curtains for Autonomous Driving0
More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for Object Recognition0
MOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV Videos0
Multiple Class Novelty Detection Under Data Distribution Shift0
TactileSGNet: A Spiking Graph Neural Network for Event-based Tactile Object RecognitionCode1
Self-supervised Visual Attribute Learning for Fashion Compatibility0
Feature Learning for Accelerometer based Gait RecognitionCode0
Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases0
Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve0
Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains0
Self-Supervised Learning Across Domains0
Parkinson's Disease Detection with Ensemble Architectures based on ILSVRC Models0
Learning from Extrinsic and Intrinsic Supervisions for Domain Generalization0
Adaptive Hierarchical Decomposition of Large Deep Networks0
The Effect of Top-Down Attention in Occluded Object Recognition0
Do Adversarially Robust ImageNet Models Transfer Better?Code1
When and how CNNs generalize to out-of-distribution category-viewpoint combinationsCode1
Hardware Implementation of Hyperbolic Tangent Function using Catmull-Rom Spline Interpolation0
Nested Learning For Multi-Granular Tasks0
Seeing eye-to-eye? A comparison of object recognition performance in humans and deep convolutional neural networks under image manipulation0
Quaternion Capsule NetworksCode0
Single Shot MC Dropout ApproximationCode1
Are Labels Always Necessary for Classifier Accuracy Evaluation?0
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks0
Parkinson's Disease Detection Using Ensemble Architecture from MR Images0
Beyond accuracy: quantifying trial-by-trial behaviour of CNNs and humans by measuring error consistencyCode0
Fast Training of Deep Networks with One-Class CNNs0
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion0
A Comparative Analysis on Bangla Handwritten Digit Recognition with Data Augmentation and Non-Augmentation ProcessCode0
Teaching CNNs to mimic Human Visual Cognitive Process & regularise Texture-Shape biasCode0
Hierarchically Compositional Tasks and Deep Convolutional Networks0
Object recognition through pose and shape estimation0
Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation0
RP2K: A Large-Scale Retail Product Dataset for Fine-Grained Image Classification0
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group SoftmaxCode1
Enhancing Few-Shot Image Classification with Unlabelled Examples0
Show:102550
← PrevPage 19 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified