SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 9511000 of 2042 papers

TitleStatusHype
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Combining Lexical and Spatial Knowledge to Predict Spatial Relations between Objects in Images0
A Proper Orthogonal Decomposition approach for parameters reduction of Single Shot Detector networks0
Explaining Clinical Decision Support Systems in Medical Imaging using Cycle-Consistent Activation Maximization0
Combining Deep Transfer Learning with Signal-image Encoding for Multi-Modal Mental Wellbeing Classification0
Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations0
Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition0
Image Optimization and Prediction0
A Framework for Multi-View Classification of Features0
Combined Approach for Image Segmentation0
Expanding a robot's life: Low power object recognition via FPGA-based DCNN deployment0
Combinatorial clustering and the beta negative binomial process0
Imaging-free object recognition enabled by optical coherence0
Immersive Language Exploration with Object Recognition and Augmented Reality0
A Probabilistic Framework for Dynamic Object Recognition in 3D Environment With A Novel Continuous Ground Estimation Method0
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data0
Improved Deep Learning of Object Category using Pose Information0
Improved Deep Metric Learning with Multi-class N-pair Loss Objective0
Exact neural mass model for synaptic-based working memory0
Improved Inception-Residual Convolutional Neural Network for Object Recognition0
Improved training of binary networks for human pose estimation and image recognition0
Improved visible to IR image transformation using synthetic data augmentation with cycle-consistent adversarial networks0
A Voxel Graph CNN for Object Classification with Event Cameras0
Combinational neural network using Gabor filters for the classification of handwritten digits0
Coloring Objects: Adjective-Noun Visual Semantic Compositionality0
A priori compression of convolutional neural networks for wave simulators0
Enhancing Few-Shot Image Classification with Unlabelled Examples0
Improving Gibbs Sampler Scan Quality with DoGS0
Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations0
A Fog Robotic System for Dynamic Visual Servoing0
Improving neural networks with bunches of neurons modeled by Kumaraswamy units: Preliminary study0
Improving Performance of Object Detection using the Mechanisms of Visual Recognition in Humans0
EV-Flying: an Event-based Dataset for In-The-Wild Recognition of Flying Objects0
EventF2S: Asynchronous and Sparse Spiking AER Framework using Neuromorphic-Friendly Algorithm0
Improving Robustness of Feature Representations to Image Deformations using Powered Convolution in CNNs0
EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
Improving the Accuracy and Robustness of CNNs Using a Deep CCA Neural Data Regularizer0
Collaborative Descriptors: Convolutional Maps for Preprocessing0
Approximation of dilation-based spatial relations to add structural constraints in neural networks0
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects0
EventDance++: Language-guided Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition0
Incorporating Network Built-in Priors in Weakly-supervised Semantic Segmentation0
Incorporating Semantic Attention in Video Description Generation0
Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection0
Incorporating Textual Evidence in Visual Storytelling0
Incremental Learning for Robot Perception through HRI0
Collaboration Analysis Using Deep Learning0
Inducing Functions through Reinforcement Learning without Task Specification0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
Show:102550
← PrevPage 20 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified