SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 701750 of 2042 papers

TitleStatusHype
Diversity in Object Proposals0
CNN-based search model underestimates attention guidance by simple visual features0
Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition0
A neuromorphic approach to image processing and machine vision0
Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities0
Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching0
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users0
Evaluating Progress on Machine Learning for Longitudinal Electronic Healthcare Data0
Evaluation of Environmental Conditions on Object Detection using Oriented Bounding Boxes for AR Applications0
EvConv: Fast CNN Inference on Event Camera Inputs For High-Speed Robot Perception0
Advancing Egocentric Video Question Answering with Multimodal Large Language Models0
300 GHz Radar Object Recognition based on Deep Neural Networks and Transfer Learning0
Few-shot target-driven instance detection based on open-vocabulary object detection models0
EventDance++: Language-guided Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
EventF2S: Asynchronous and Sparse Spiking AER Framework using Neuromorphic-Friendly Algorithm0
Fine-grained 3D object recognition: an approach and experiments0
EV-Flying: an Event-based Dataset for In-The-Wild Recognition of Flying Objects0
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN0
BranchConnect: Large-Scale Visual Recognition with Learned Branch Connections0
A Voxel Graph CNN for Object Classification with Event Cameras0
Exact neural mass model for synaptic-based working memory0
Combinatorial clustering and the beta negative binomial process0
Expanding a robot's life: Low power object recognition via FPGA-based DCNN deployment0
Distributed Coding of Multiview Sparse Sources with Joint Recovery0
Combined Approach for Image Segmentation0
A Neuro-AI Interface: Learning DNNs from the Human Brain0
Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations0
A Dual-hierarchy Semantic Graph for Robust Object Recognition0
Explaining Clinical Decision Support Systems in Medical Imaging using Cycle-Consistent Activation Maximization0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Exploit Bounding Box Annotations for Multi-label Object Recognition0
Feature Space Transfer for Data Augmentation0
Exploiting an Oracle that Reports AUC Scores in Machine Learning Contests0
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation0
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks0
A Neural Spiking Approach Compared to Deep Feedforward Networks on Stepwise Pixel Erasement0
Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks0
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism0
Exploring Context and Visual Pattern of Relationship for Scene Graph Generation0
A randomized gradient-free attack on ReLU networks0
Exploring Temporal Differences in 3D Convolutional Neural Networks0
The Origins and Prevalence of Texture Bias in Convolutional Neural Networks0
Feature Evaluation of Deep Convolutional Neural Networks for Object Recognition and Detection0
Exponential Discriminative Metric Embedding in Deep Learning0
Extreme Image Transformations Affect Humans and Machines Differently0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
EZSR: Event-based Zero-Shot Recognition0
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset0
Disentangling Properties of Contrastive Methods0
Show:102550
← PrevPage 15 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified