SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 18011850 of 2042 papers

TitleStatusHype
SegICP: Integrated Deep Semantic Segmentation and Pose EstimationCode0
Multiple Object Recognition with Visual AttentionCode0
SharpNet: Fast and Accurate Recovery of Occluding Contours in Monocular Depth EstimationCode0
Multi-level 3D CNN for Learning Multi-scale Spatial FeaturesCode0
Image Captioning using Deep Neural ArchitecturesCode0
Improved object recognition using neural networks trained to mimic the brain's statistical propertiesCode0
Multiscale Dubuc: A New Similarity Measure for Time SeriesCode0
Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object RecognitionCode0
A Dataset for Crucial Object Recognition in Blind and Low-Vision Individuals' NavigationCode0
ImageNet Classification with Deep Convolutional Neural NetworksCode0
Food Image Recognition by Using Convolutional Neural Networks (CNNs)Code0
Prediction Surface Uncertainty Quantification in Object Detection Models for Autonomous DrivingCode0
Fit to Measure: Reasoning about Sizes for Robust Object RecognitionCode0
Multi-stage Deep Classifier Cascades for Open World RecognitionCode0
Unsupervised Domain Adaptation using Feature-Whitening and Consensus LossCode0
Image Privacy Prediction Using Deep Neural NetworksCode0
Privacy Leakage of SIFT Features via Deep Generative Model based Image ReconstructionCode0
Image Style Transfer Using Convolutional Neural NetworksCode0
Imagine2touch: Predictive Tactile Sensing for Robotic Manipulation using Efficient Low-Dimensional SignalsCode0
SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous DrivingCode0
Self-supervised Domain Adaptation for Computer Vision TasksCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Probing Multimodal Large Language Models for Global and Local Semantic RepresentationsCode0
Transfer Learning based Detection of Diabetic Retinopathy from Small DatasetCode0
MVP-Bench: Can Large Vision--Language Models Conduct Multi-level Visual Perception Like Humans?Code0
MVT: Multi-view Vision Transformer for 3D Object RecognitionCode0
Characterizing and evaluating adversarial examples for Offline Handwritten Signature VerificationCode0
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOSCode0
Naturally Computed Scale Invariance in the Residual Stream of ResNet18Code0
Projected Distribution Loss for Image EnhancementCode0
Improving Annotation for 3D Pose Dataset of Fine-Grained Object CategoriesCode0
Fine-grained Attention and Feature-sharing Generative Adversarial Networks for Single Image Super-ResolutionCode0
Central Moment Discrepancy (CMD) for Domain-Invariant Representation LearningCode0
CBM: Curriculum by MaskingCode0
Continual egocentric object recognitionCode0
SwiDeN : Convolutional Neural Networks For Depiction Invariant Object RecognitionCode0
Finding Tiny FacesCode0
THOR2: Topological Analysis for 3D Shape and Color-Based Human-Inspired Object Recognition in Unseen EnvironmentsCode0
NetTailor: Tuning the Architecture, Not Just the WeightsCode0
Improving Pre-Trained Weights Through Meta-Heuristics Fine-TuningCode0
FewSOL: A Dataset for Few-Shot Object Learning in Robotic EnvironmentsCode0
ContextMix: A context-aware data augmentation method for industrial visual inspection systemsCode0
Improving Robustness to Model Inversion Attacks via Sparse Coding ArchitecturesCode0
Context-Aware Zero-Shot RecognitionCode0
Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position PredictivityCode0
Inception Recurrent Convolutional Neural Network for Object RecognitionCode0
Putting visual object recognition in contextCode0
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape RecognitionCode0
Temporal-Coded Deep Spiking Neural Network with Easy Training and Robust PerformanceCode0
PydMobileNet: Improved Version of MobileNets with Pyramid Depthwise Separable ConvolutionCode0
Show:102550
← PrevPage 37 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified