SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 13511400 of 2042 papers

TitleStatusHype
Latent Bi-constraint SVM for Video-based Object Recognition0
Latent Cognizance: What Machine Really Learns0
Latent Constrained Correlation Filter0
Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning0
Layerwise complexity-matched learning yields an improved model of cortical area V20
Learn and Search: An Elegant Technique for Object Lookup using Contrastive Learning0
Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks0
Learning about Canonical Views from Internet Image Collections0
Learning a discriminative hidden part model for human action recognition0
Learning and Calibrating Per-Location Classifiers for Visual Place Recognition0
Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks0
Learning Attributes Equals Multi-Source Domain Generalization0
Learning Beyond Human Expertise with Generative Models for Dental Restorations0
Learning by Asking Questions for Knowledge-based Novel Object Recognition0
Learning Canonical 3D Object Representation for Fine-Grained Recognition0
Learning Collections of Part Models for Object Recognition0
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks0
Learning data association without data association: An EM approach to neural assignment prediction0
Learning Deep Features for Scene Recognition using Places Database0
Learning Descriptors for Object Recognition and 3D Pose Estimation0
Learning Detailed Face Reconstruction from a Single Image0
Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels0
Learning from Extrinsic and Intrinsic Supervisions for Domain Generalization0
Learning From Less Data: A Unified Data Subset Selection and Active Learning Framework for Computer Vision0
Learning From Less Data: Diversified Subset Selection and Active Learning in Image Classification Tasks0
Learning Good Representation via Continuous Attention0
Learning Hard Alignments with Variational Inference0
Learning Hierarchical Sparse Representations using Iterative Dictionary Learning and Dimension Reduction0
Learning Illuminant Estimation from Object Recognition0
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics0
Learning in Markov Random Fields using Tempered Transitions0
Learning invariant representations and applications to face verification0
Learning Localized Geometric Features Using 3D-CNN: An Application to Manufacturability Analysis of Drilled Holes0
Learning Multi-Scale Representations for Material Classification0
Learning Probabilistic Intersection Traffic Models for Trajectory Prediction0
Learning Receptive Fields for Pooling from Tensors of Feature Response0
Learning Relationships for Multi-View 3D Object Recognition0
Learning Representations of Graph Data -- A Survey0
Learning Robust Object Recognition Using Composed Scenes from Generative Models0
Learning Scalable Discriminative Dictionary with Sample Relatedness0
Learning Scene Gist with Convolutional Neural Networks to Improve Object Recognition0
Learning Semantic Similarities for Prototypical Classifiers0
Learning the Redundancy-free Features for Generalized Zero-Shot Object Recognition0
Learning to Combine Mid-Level Cues for Object Proposal Generation0
Learning to Learn: How to Continuously Teach Humans and Machines0
Learning to Learn with Compound HD Models0
Learning to Localize and Align Fine-Grained Actions to Sparse Instructions0
Learning to Recognize Objects by Retaining other Factors of Variation0
Learning to Represent Image and Text with Denotation Graph0
Learning to see across Domains and Modalities0
Show:102550
← PrevPage 28 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified