SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 14511500 of 2042 papers

TitleStatusHype
Meta-neural-network for Realtime and Passive Deep-learning-based Object Recognition0
Metric Learning as a Service with Covariance Embedding0
MIML-FCN+: Multi-instance Multi-label Learning via Fully Convolutional Networks with Privileged Information0
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds0
Minimal Images in Deep Neural Networks: Fragile Object Recognition in Natural Images0
MirBot: A collaborative object recognition system for smartphones using convolutional neural networks0
Misalignment Resilient Diffractive Optical Networks0
Missing Modalities Imputation via Cascaded Residual Autoencoder0
Mitigating the Hubness Problem for Zero-Shot Learning of 3D Objects0
MixStyle Neural Networks for Domain Generalization and Adaptation0
MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition0
Modality and Component Aware Feature Fusion For RGB-D Scene Classification0
Model alignment using inter-modal bridges0
Model-based active learning to detect isometric deformable objects in the wild with deep architectures0
Model compression as constrained optimization, with application to neural nets. Part I: general framework0
Modeling biological face recognition with deep convolutional neural networks0
Modeling Human Development: Effects of Blurred Vision on Category Learning in CNNs0
Modeling infant object perception as program induction0
Modeling Object Recognition in Newborn Chicks using Deep Neural Networks0
Modeling the Contribution of Central Versus Peripheral Vision in Scene, Object, and Face Recognition0
Modeling the Sequence of Brain Volumes by Local Mesh Models for Brain Decoding0
Modelling Local Deep Convolutional Neural Network Features to Improve Fine-Grained Image Classification0
Modelling of LIDAR sensor disturbances by solid airborne particles0
Monocular SLAM Supported Object Recognition0
MonoStream: A Minimal-Hardware High Accuracy Device-free WLAN Localization System0
More Than Accuracy: Towards Trustworthy Machine Learning Interfaces for Object Recognition0
More Words and Bigger Pictures0
MOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV Videos0
Motion Equivariance OF Event-based Camera Data with the Temporal Normalization Transform0
Motion Mapping Cognition: A Nondecomposable Primary Process in Human Vision0
MotionRec: A Unified Deep Framework for Moving Object Recognition0
Motion Segmentation from a Moving Monocular Camera0
Multi-3D-Models Registration-Based Augmented Reality (AR) Instructions for Assembly0
MultiCrossViT: Multimodal Vision Transformer for Schizophrenia Prediction using Structural MRI and Functional Network Connectivity Data0
Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning0
Multi-label Multiple Kernel Learning by Stochastic Approximation: Application to Visual Object Recognition0
Multi-label Object Attribute Classification using a Convolutional Neural Network0
Multilevel Context Representation for Improving Object Recognition0
Multimodal Skip-gram Using Convolutional Pseudowords0
Multiple Class Novelty Detection Under Data Distribution Shift0
Multiple Instance Learning Convolutional Neural Networks for Object Recognition0
Multiple Kernel Sparse Representations for Supervised and Unsupervised Learning0
Multiple Moving Object Recognitions in video based on Log Gabor-PCA Approach0
Multipod Convolutional Network0
Multiscale Adaptive Representation of Signals: I. The Basic Framework0
Multi-Scale Saliency Detection using Dictionary Learning0
Multi-Scale Weight Sharing Network for Image Recognition0
Multisource Region Attention Network for Fine-Grained Object Recognition in Remote Sensing Imagery0
Multitask Network for Joint Object Detection, Semantic Segmentation and Human Pose Estimation in Vehicle Occupancy Monitoring0
Multi-View Harmonized Bilinear Network for 3D Object Recognition0
Show:102550
← PrevPage 30 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified