SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 701750 of 2042 papers

TitleStatusHype
Wasserstein Barycenter for Multi-Source Domain AdaptationCode1
Graph-Based High-Order Relation Discovery for Fine-Grained Recognition0
Cloud based Scalable Object Recognition from Video Streams using Orientation Fusion and Convolutional Neural Networks0
Place recognition survey: An update on deep learning approaches0
Hidden Patch Attacks for Optical Flow0
The role of temporal cortex in the control of attention0
Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result0
Deep Subdomain Adaptation Network for Image ClassificationCode1
Self-Supervised Learning with Kernel Dependence MaximizationCode1
Modeling Object Recognition in Newborn Chicks using Deep Neural Networks0
Partial success in closing the gap between human and machine visionCode1
A Novel mapping for visual to auditory sensory substitution0
Learning the Precise Feature for Cluster AssignmentCode0
On the role of feedback in visual processing: a predictive coding perspectiveCode0
Person Re-Identification with a Locally Aware TransformerCode1
A Hybrid APM-CPGSO Approach for Constraint Satisfaction Problem Solving: Application to Remote Sensing0
Convolutional Neural Networks with Gated Recurrent ConnectionsCode1
DOCTOR: A Simple Method for Detecting Misclassification ErrorsCode1
Simultaneous Multi-View Object Recognition and Grasping in Open-Ended Domains0
Statistical Mechanics of Neural Processing of Object Manifolds0
A Voxel Graph CNN for Object Classification with Event Cameras0
Highlight Timestamp Detection Model for Comedy Videos via Multimodal Sentiment Analysis0
GuideMe: A Mobile Application based on Global Positioning System and Object Recognition Towards a Smart Tourist Guide0
Weakly Supervised Instance Attention for Multisource Fine-Grained Object Recognition with an Application to Tree Species Classification0
An online passive-aggressive algorithm for difference-of-squares classification0
Opening Deep Neural Networks with Generative ModelsCode0
Superpixel-based Knowledge Infusion in Deep Neural Networks for Image ClassificationCode1
Are Convolutional Neural Networks or Transformers more like human vision?Code1
Brain Inspired Face Recognition: A Computational Framework0
SizeNet: Object Recognition via Object Real Size-based Convolutional Networks0
ORCEA: Object Recognition by Continuous Evidence Assimilation0
Modelling of LIDAR sensor disturbances by solid airborne particles0
BIM Hyperreality: Data Synthesis Using BIM and Hyperrealistic Rendering for Deep Learning0
This Looks Like That... Does it? Shortcomings of Latent Space Prototype Interpretability in Deep NetworksCode1
Canonical Saliency Maps: Decoding Deep Face ModelsCode0
Learning data association without data association: An EM approach to neural assignment prediction0
IPatch: A Remote Adversarial Patch0
ODDObjects: A Framework for Multiclass Unsupervised Anomaly Detection on Masked ObjectsCode0
RelTransformer: A Transformer-Based Long-Tail Visual Relationship RecognitionCode1
Recurrent Feedback Improves Recognition of Partially Occluded Objects0
SPARK: SPAcecraft Recognition leveraging Knowledge of Space Environment0
Rock Hunting With Martian Machine Vision0
ORBIT: A Real-World Few-Shot Dataset for Teachable Object RecognitionCode1
Artificial and beneficial -- Exploiting artificial images for aerial vehicle detection0
Tuned Compositional Feature Replays for Efficient Stream LearningCode0
Achieving Domain Generalization in Underwater Object Detection by Domain Mixup and Contrastive Learning0
Training Deep Neural Networks via Branch-and-BoundCode0
A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis0
Domain-robust VQA with diverse datasets and methods but no target labels0
CNN-based search model underestimates attention guidance by simple visual features0
Show:102550
← PrevPage 15 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified