SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 16511700 of 2042 papers

TitleStatusHype
Pose-Invariant Object Recognition for Event-Based Vision with Slow-ELM0
Positive-Unlabeled Domain Adaptation0
Pragmatic descriptions of perceptual stimuli0
Predicting beauty, liking, and aesthetic quality: A comparative analysis of image databases for visual aesthetics research0
Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving0
Predicting When Saliency Maps Are Accurate and Eye Fixations Consistent0
Pre-Trained Convolutional Neural Network Features for Facial Expression Recognition0
Proactive Adversarial Defense: Harnessing Prompt Tuning in Vision-Language Models to Detect Unseen Backdoored Images0
Importance Filtered Cross-Domain Adaptation0
Procedural Text Generation from an Execution Video0
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks0
Projection: A Mechanism for Human-like Reasoning in Artificial Intelligence0
PROTOTYPE-ASSISTED ADVERSARIAL LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION0
Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks0
PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation0
Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency0
Quantifying Adversarial Sensitivity of a Model as a Function of the Image Distribution0
Quantifying Translation-Invariance in Convolutional Neural Networks0
Quantum Doubly Stochastic Transformers0
Query Adaptive Similarity Measure for RGB-D Object Recognition0
Radar-based Materials Classification Using Deep Wavelet Scattering Transform: A Comparison of Centimeter vs. Millimeter Wave Units0
RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline0
Randomized Kernel Multi-view Discriminant Analysis0
Ranking to Learn: Feature Ranking and Selection via Eigenvector Centrality0
RAPTOR: Refined Approach for Product Table Object Recognition0
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing0
Object-Size-Driven Design of Convolutional Neural Networks: Virtual Axle Detection based on Raw Data0
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation0
Reading Ancient Coin Legends: Object Recognition vs. OCR0
Achieving More Human Brain-Like Vision via Human EEG Representational Alignment0
Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement0
Real-time Monocular Object SLAM0
Real Time Surveillance for Low Resolution and Limited-Data Scenarios: An Image Set Classification Approach0
Real-world Object Recognition with Off-the-shelf Deep Conv Nets: How Many Objects can iCub Learn?0
Reason from Context with Self-supervised Learning0
Recognition Awareness: An Application of Latent Cognizance to Open-Set Recognition0
Recognizing Objects From Any View With Object and Viewer-Centered Representations0
Recognizing Objects In-the-wild: Where Do We Stand?0
Recognizing Open-Vocabulary Relations between Objects in Images0
Recognizing RGB Images by Learning from RGB-D Data0
Recurrent 3D Attentional Networks for End-to-End Active Object Recognition0
Recurrent Connections Aid Occluded Object Recognition by Discounting Occluders0
Recurrent Connectivity Aids Recognition of Partly Occluded Objects0
Recurrent Convolutional Neural Network for Object Recognition0
Recurrent Feedback Improves Recognition of Partially Occluded Objects0
Generalized Relevance Learning Grassmann QuantizationCode0
Generate To Adapt: Aligning Domains using Generative Adversarial NetworksCode0
A Probabilistic Theory of Deep LearningCode0
3D Object Recognition with Ensemble Learning --- A Study of Point Cloud-Based Deep Learning ModelsCode0
Parsing Geometry Using Structure-Aware Shape TemplatesCode0
Show:102550
← PrevPage 34 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified