SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 20012042 of 2042 papers

TitleStatusHype
Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities0
Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching0
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users0
Evaluating Progress on Machine Learning for Longitudinal Electronic Healthcare Data0
Evaluation of Environmental Conditions on Object Detection using Oriented Bounding Boxes for AR Applications0
EvConv: Fast CNN Inference on Event Camera Inputs For High-Speed Robot Perception0
EventDance++: Language-guided Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
EventF2S: Asynchronous and Sparse Spiking AER Framework using Neuromorphic-Friendly Algorithm0
EV-Flying: an Event-based Dataset for In-The-Wild Recognition of Flying Objects0
A Voxel Graph CNN for Object Classification with Event Cameras0
Exact neural mass model for synaptic-based working memory0
Expanding a robot's life: Low power object recognition via FPGA-based DCNN deployment0
Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations0
Explaining Clinical Decision Support Systems in Medical Imaging using Cycle-Consistent Activation Maximization0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Exploit Bounding Box Annotations for Multi-label Object Recognition0
Exploiting an Oracle that Reports AUC Scores in Machine Learning Contests0
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation0
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks0
Exploiting Temporal Relations on Radar Perception for Autonomous Driving0
Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks0
Exploration of object recognition from 3D point cloud0
Exploring Context and Visual Pattern of Relationship for Scene Graph Generation0
Exploring Temporal Differences in 3D Convolutional Neural Networks0
The Origins and Prevalence of Texture Bias in Convolutional Neural Networks0
Exponential Discriminative Metric Embedding in Deep Learning0
Extreme Image Transformations Affect Humans and Machines Differently0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
EZSR: Event-based Zero-Shot Recognition0
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset0
Face Identification with Second-Order Pooling0
Face processing emerges from object-trained convolutional neural networks0
Face-space Action Recognition by Face-Object Interactions0
Factorization of View-Object Manifolds for Joint Object Recognition and Pose Estimation0
Farm land weed detection with region-based deep convolutional neural networks0
Fashioning with Networks: Neural Style Transfer to Design Clothes0
Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition0
Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels0
Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations0
Fast Fourier Transformation for Optimizing Convolutional Neural Networks in Object Recognition0
Fast Neuromimetic Object Recognition using FPGA Outperforms GPU Implementations0
Show:102550
← PrevPage 41 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified