SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 351400 of 2042 papers

TitleStatusHype
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Context-Aware Zero-Shot RecognitionCode0
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural ImagesCode0
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?Code0
Characterizing and evaluating adversarial examples for Offline Handwritten Signature VerificationCode0
Human Pose Estimation for Real-World Crowded ScenariosCode0
Finding Tiny FacesCode0
Collaboratively Weighting Deep and Classic Representation via L2 Regularization for Image ClassificationCode0
Fine-grained Attention and Feature-sharing Generative Adversarial Networks for Single Image Super-ResolutionCode0
ImageNet Classification with Deep Convolutional Neural NetworksCode0
Generalizing to unseen domains via distribution matchingCode0
Image Privacy Prediction Using Deep Neural NetworksCode0
Central Moment Discrepancy (CMD) for Domain-Invariant Representation LearningCode0
FewSOL: A Dataset for Few-Shot Object Learning in Robotic EnvironmentsCode0
Foveated Instance SegmentationCode0
Fast Feature Fool: A data independent approach to universal adversarial perturbationsCode0
CBM: Curriculum by MaskingCode0
Feature Learning by Multidimensional Scaling and its Applications in Object RecognitionCode0
Faster gaze prediction with dense networks and Fisher pruningCode0
Comparative evaluation of CNN architectures for Image Caption GenerationCode0
Feature Learning for Accelerometer based Gait RecognitionCode0
A Framework of Transfer Learning in Object Detection for Embedded SystemsCode0
Causal importance of orientation selectivity for generalization in image recognitionCode0
Interpreting Adversarially Trained Convolutional Neural NetworksCode0
Facial Expression Recognition Research Based on Deep LearningCode0
Exploring Novel Object Recognition and Spontaneous Location Recognition Machine Learning Analysis Techniques in Alzheimer's MiceCode0
A Comparative Analysis on Bangla Handwritten Digit Recognition with Data Augmentation and Non-Augmentation ProcessCode0
Investigating the Nature of 3D Generalization in Deep Neural NetworksCode0
Feature Pyramid GridsCode0
Foveation in the Era of Deep LearningCode0
Kernel Manifold AlignmentCode0
Knowledge-driven Active LearningCode0
Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language ModelsCode0
MetaCOG: A Hierarchical Probabilistic Model for Learning Meta-Cognitive Visual RepresentationsCode0
EXOT: Exit-aware Object Tracker for Safe Robotic Manipulation of Moving ObjectCode0
Cartesian K-MeansCode0
Captioning Images with Diverse ObjectsCode0
Canonical Saliency Maps: Decoding Deep Face ModelsCode0
Ensemble learning in CNN augmented with fully connected subnetworksCode0
Experiments with mmWave Automotive Radar Test-bedCode0
Can Large Language Models Grasp Event Signals? Exploring Pure Zero-Shot Event-based RecognitionCode0
Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch RecognitionCode0
Are Vision Transformers More Data Hungry Than Newborn Visual Systems?Code0
Efficient Event Stream Super-Resolution with Recursive Multi-Branch FusionCode0
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training AcceleratorsCode0
Learning Where to Edit Vision TransformersCode0
Probing Human Visual Robustness with Neurally-Guided Deep Neural NetworksCode0
End-to-End Learning of Representations for Asynchronous Event-Based DataCode0
Dynamic Rectification Knowledge DistillationCode0
BViT: Broad Attention based Vision TransformerCode0
Show:102550
← PrevPage 8 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified