SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 351400 of 2042 papers

TitleStatusHype
Human Action Recognition in Still Images Using ConViT0
A Novel Explainable Artificial Intelligence Model in Image Classification problem0
Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning0
Object Recognition System on a Tactile Device for Visually Impaired0
Predicting beauty, liking, and aesthetic quality: A comparative analysis of image databases for visual aesthetics research0
Look, Remember and Reason: Grounded reasoning in videos with language models0
Evaluation of Environmental Conditions on Object Detection using Oriented Bounding Boxes for AR Applications0
Fine-grained 3D object recognition: an approach and experiments0
Towards Language Models That Can See: Computer Vision Through the LENS of Natural LanguageCode2
Regulation of Mouse Learning and Mood by the Anti-Inflammatory Cytokine Interleukin-100
Object Detection based on the Collection of Geometric Evidence0
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
Resume Information Extraction via Post-OCR Text Processing0
Object Topological Character Acquisition by Inductive Learning0
Sample-Efficient Learning of Novel Visual ConceptsCode0
Bayesian and Neural Inference on LSTM-based Object Recognition from Tactile and Kinesthetic InformationCode0
EventCLIP: Adapting CLIP for Event-based Object RecognitionCode1
EXOT: Exit-aware Object Tracker for Safe Robotic Manipulation of Moving ObjectCode0
A newborn embodied Turing test for view-invariant object recognition0
Performance-optimized deep neural networks are evolving into worse models of inferotemporal visual cortex0
Adversarial alignment: Breaking the trade-off between the strength of an attack and its relevance to human perception0
The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects0
CVSNet: A Computer Implementation for Central Visual System of The Brain0
A Framework For Refining Text Classification and Object Recognition from Academic Articles0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?0
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated InteractionsCode0
Target-Aware Generative Augmentations for Single-Shot AdaptationCode0
CNN-based Methods for Object Recognition with High-Resolution Tactile SensorsCode0
Paxion: Patching Action Knowledge in Video-Language Foundation ModelsCode1
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses0
Learning Semi-supervised Gaussian Mixture Models for Generalized Category DiscoveryCode1
Effects of Real-Life Traffic Sign Alteration on YOLOv7- an Object Recognition Model0
Egocentric Hierarchical Visual Semantics0
Persistent Homology Meets Object Unity: Object Recognition in ClutterCode0
GAANet: Ghost Auto Anchor Network for Detecting Varying Size Drones in DarkCode0
HeteroEdge: Addressing Asymmetry in Heterogeneous Collaborative Autonomous Systems0
A Systematic Study on Object Recognition Using Millimeter-wave Radar0
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN0
Neurosymbolic AI - Why, What, and How0
Discover and Cure: Concept-aware Mitigation of Spurious CorrelationCode1
Deep Graph Reprogramming0
From Chaos Comes Order: Ordering Event Representations for Object Recognition and DetectionCode1
Investigating the Nature of 3D Generalization in Deep Neural NetworksCode0
Optimizing Multi-Domain Performance with Active Learning-based Improvement Strategies0
Pinpointing Why Object Recognition Performance Degrades Across Income Levels and GeographiesCode0
A priori compression of convolutional neural networks for wave simulators0
Boosting Cross-task Transferability of Adversarial Patches with Visual Relations0
Domain Generalization In Robust Invariant RepresentationCode0
What's in a Name? Beyond Class Indices for Image Recognition0
Show:102550
← PrevPage 8 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified