SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 651700 of 2042 papers

TitleStatusHype
Data-driven Verification of DNNs for Object Recognition0
Data-Driven 3D Voxel Patterns for Object Category Recognition0
A temporal neural network model for object recognition using a biologically plausible decision making layer0
Data Augmentation by Selecting Mixed Classes Considering Distance Between Classes0
DAS: A Deformable Attention to Capture Salient Information in CNNs0
A Systematic Study on Object Recognition Using Millimeter-wave Radar0
DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection0
A Survey on Recent Advances of Computer Vision Algorithms for Egocentric Video0
`Lighter' Can Still Be Dark: Modeling Comparative Color Descriptions0
Adaptive Color Attributes for Real-Time Visual Tracking0
AAD: Adaptive Anomaly Detection through traffic surveillance videos0
MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence0
CVSNet: A Computer Implementation for Central Visual System of The Brain0
A Survey of Task-Based Machine Learning Content Extraction Services for VIDINT0
CURL: Co-trained Unsupervised Representation Learning for Image Classification0
A Study of Image Pre-processing for Faster Object Recognition0
Algorithms for Object Detection in Substations0
Crowdsourcing in Computer Vision0
A Study for Universal Adversarial Attacks on Texture Recognition0
ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition0
Adaptive Active Learning for Image Classification0
CPWC: Contextual Point Wise Convolution for Object Recognition0
Counting the learnable functions of structured data0
Early or Late Fusion Matters: Efficient RGB-D Fusion in Vision Transformers for 3D Object Recognition0
Co-training Transformer with Videos and Images Improves Action Recognition0
A large comparison of feature-based approaches for buried target classification in forward-looking ground-penetrating radar0
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection0
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation0
Assessing The Importance Of Colours For CNNs In Object Recognition0
CortexNet: a Generic Network Family for Robust Visual Temporal Representations0
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition0
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels0
A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition0
A 51.3 TOPS/W, 134.4 GOPS In-memory Binary Image Filtering in 65nm CMOS0
Coordinating Cross-modal Distillation for Molecular Property Prediction0
A Spike Learning System for Event-driven Object Recognition0
Cooking Object's State Identification Without Using Pretrained Model0
Co-occurrence matrix analysis-based semi-supervised training for object detection0
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation0
Airplane Type Identification Based on Mask RCNN; An Approach to Reduce Airport Traffic Congestion0
Convolutional Tables Ensemble: classification in microseconds0
Convolutional Spike Timing Dependent Plasticity based Feature Learning in Spiking Neural Networks0
Convolutional Prototype Learning for Zero-Shot Recognition0
Convolutional Neural Networks with Intra-Layer Recurrent Connections for Scene Labeling0
A Simulation Method for MMW Radar Sensing in Traffic Intersection Based on BART Algorithm0
AIris: An AI-powered Wearable Assistive Device for the Visually Impaired0
Convolutional Neural Networks Trained to Identify Words Provide a Surprisingly Good Account of Visual Form Priming Effects0
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future0
Convolutional Networks with Dense Connectivity0
A ``Shape Aware'' Model for semi-supervised Learning of Objects and its Context0
Show:102550
← PrevPage 14 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified