SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 401450 of 2042 papers

TitleStatusHype
AI-based Density Recognition0
Active Perception using Light Curtains for Autonomous Driving0
Contextual Recurrent Convolutional Model for Robust Visual Learning0
Artistic Object Recognition by Unsupervised Style Adaptation0
Continual Hyperbolic Learning of Instances and Classes0
Continual-Learning-as-a-Service (CLaaS): On-Demand Efficient Adaptation of Predictive Models0
Collaborative Descriptors: Convolutional Maps for Preprocessing0
Continual Learning for Pose-Agnostic Object Recognition in 3D Point Clouds0
Artwork Recognition for Panorama Images Based on Optimized ASIFT and Cubic Projection0
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling0
Learning Visual Models using a Knowledge Graph as a Trainer0
Contrastive Object Detection Using Knowledge Graph Embeddings0
Contrastive Reasoning in Neural Networks0
Deep-Learning Convolutional Neural Networks for scattered shrub detection with Google Earth Imagery0
Collaboration Analysis Using Deep Learning0
Controlled Tactile Exploration and Haptic Object Recognition0
Convex Class Model on Symmetric Positive Definite Manifolds0
Convolutional Models for Joint Object Categorization and Pose Estimation0
Convolutional Networks with Dense Connectivity0
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future0
Approximate Log-Hilbert-Schmidt Distances Between Covariance Operators for Image Classification0
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs0
Convolutional Prototype Learning for Zero-Shot Recognition0
Convolutional Spike Timing Dependent Plasticity based Feature Learning in Spiking Neural Networks0
Convolutional Tables Ensemble: classification in microseconds0
Co-occurrence matrix analysis-based semi-supervised training for object detection0
Cooking Object's State Identification Without Using Pretrained Model0
Coordinating Cross-modal Distillation for Molecular Property Prediction0
A Spike Learning System for Event-driven Object Recognition0
Applications of Probabilistic Programming (Master's thesis, 2015)0
CortexNet: a Generic Network Family for Robust Visual Temporal Representations0
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation0
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection0
Co-training Transformer with Videos and Images Improves Action Recognition0
A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis0
Counting the learnable functions of structured data0
CPWC: Contextual Point Wise Convolution for Object Recognition0
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring In Data0
Application of Faster R-CNN model on Human Running Pattern Recognition0
Affordance Labeling and Exploration: A Manifold-Based Approach0
Deep learning based infrared small object segmentation: Challenges and future directions0
CURL: Co-trained Unsupervised Representation Learning for Image Classification0
A Study of Image Pre-processing for Faster Object Recognition0
CVSNet: A Computer Implementation for Central Visual System of The Brain0
A Survey of Task-Based Machine Learning Content Extraction Services for VIDINT0
DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection0
Deep Learning for Material recognition: most recent advances and open challenges0
DAS: A Deformable Attention to Capture Salient Information in CNNs0
Deep Learning for the Classification of Lung Nodules0
Show:102550
← PrevPage 9 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified