SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 251275 of 2042 papers

TitleStatusHype
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition0
Textureless Object Recognition: An Edge-based Approach0
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation0
LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition0
MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual GroundingCode1
Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval0
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model0
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments0
Probing Multimodal Large Language Models for Global and Local Semantic RepresentationsCode0
CLoVe: Encoding Compositional Language in Contrastive Vision-Language ModelsCode1
ISCUTE: Instance Segmentation of Cables Using Text Embedding0
SpikeNAS: A Fast Memory-Aware Neural Architecture Search Framework for Spiking Neural Network-based Autonomous Agents0
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection0
Optimizing Sparse Convolution on GPUs with CUDA for 3D Point Cloud Processing in Embedded Systems0
A Benchmark Grocery Dataset of Realworld Point Clouds From Single View0
Logical recognition method for solving the problem of identification in the Internet of Things0
A comparison between humans and AI at recognizing objects in unusual posesCode0
SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language ModelsCode1
Motion Mapping Cognition: A Nondecomposable Primary Process in Human Vision0
Self-supervised learning of video representations from a child's perspectiveCode1
Lightweight Pixel Difference Networks for Efficient Visual Representation LearningCode4
Local Feature Matching Using Deep Learning: A SurveyCode2
EdgeOL: Efficient in-situ Online Learning on Edge Devices0
Achieving More Human Brain-Like Vision via Human EEG Representational Alignment0
EventF2S: Asynchronous and Sparse Spiking AER Framework using Neuromorphic-Friendly Algorithm0
Show:102550
← PrevPage 11 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified