SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 801825 of 2042 papers

TitleStatusHype
Context-LGM: Leveraging Object-Context Relation for Context-Aware Object Recognition0
Flexible deep transfer learning by separate feature embeddings and manifold alignment0
Flexible ViG: Learning the Self-Saliency for Flexible Object Recognition0
AI-based Density Recognition0
Active Perception using Light Curtains for Autonomous Driving0
ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection0
Forecasting Hands and Objects in Future Frames0
Global Deconvolutional Networks for Semantic Segmentation0
Formula-Supervised Visual-Geometric Pre-training0
Fourier descriptors based on the structure of the human primary visual cortex with applications to object recognition0
Foveated Downsampling Techniques0
Continual Hyperbolic Learning of Instances and Classes0
Continual-Learning-as-a-Service (CLaaS): On-Demand Efficient Adaptation of Predictive Models0
Illumination-Invariant Image from 4-Channel Images: The Effect of Near-Infrared Data in Shadow Removal0
Context-Aware Zero-Shot Learning for Object Recognition0
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured0
From the Virtual to the RealWorld: Referring to Objects in Real-World Spatial Scenes0
From Virtual to Real: A Framework for Verbal Interaction with Robots0
From Visual Attributes to Adjectives through Decompositional Distributional Semantics0
From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis0
Efficient Online ML API Selection for Multi-Label Classification Tasks0
Geo-locating Road Objects using Inverse Haversine Formula with NVIDIA Driveworks0
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition0
Discriminatively Trained Sparse Code Gradients for Contour Detection0
Brain-Like Object Recognition Neural Networks are more robustness to common corruptions0
Show:102550
← PrevPage 33 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified