SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 5175 of 2042 papers

TitleStatusHype
MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection0
Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving0
Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models0
TULIP: Towards Unified Language-Image Pretraining0
Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation0
OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions0
Seeing What's Not There: Spurious Correlation in Multimodal LLMs0
Object-Centric World Model for Language-Guided Manipulation0
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
Identity documents recognition and detection using semantic segmentation with convolutional neural network0
Deep learning based infrared small object segmentation: Challenges and future directions0
RAPTOR: Refined Approach for Product Table Object Recognition0
Revealing Bias Formation in Deep Neural Networks Through the Geometric Mechanisms of Human Visual Decoupling0
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models0
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition0
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal ModelsCode1
DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability0
Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics0
Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities0
NUDT4MSTAR: A Large Dataset and Benchmark Towards Remote Sensing Object Recognition in the WildCode2
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement0
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression0
AI-Powered Assistive Technologies for Visual Impairment0
Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time0
Guided SAM: Label-Efficient Part Segmentation0
Show:102550
← PrevPage 3 of 82Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified