SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 751800 of 2042 papers

TitleStatusHype
VDM-DA: Virtual Domain Modeling for Source Data-free Domain Adaptation0
Projection: A Mechanism for Human-like Reasoning in Artificial Intelligence0
DNN Quantization with Attention0
Contrastive Reasoning in Neural Networks0
F-SIOL-310: A Robotic Dataset and Benchmark for Few-Shot Incremental Object LearningCode1
Neural Networks for Semantic Gaze Analysis in XR Settings0
Hebbian Semi-Supervised Learning in a Sample Efficiency Setting0
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation0
PatchNet -- Short-range Template Matching for Efficient Video ProcessingCode1
Structure-From-Motion and RGBD Depth Fusion0
A Study of Face Obfuscation in ImageNetCode1
Point Cloud Sampling via Graph Balancing and Gershgorin Disc Alignment0
FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing ImageryCode1
A Discriminative Vectorial Framework for Multi-modal Feature Representation0
Contemplating real-world object classificationCode1
Offboard 3D Object Detection from Point Cloud Sequences0
Deep Clustering by Semantic Contrastive Learning0
Cooking Object's State Identification Without Using Pretrained Model0
Domain Generalization: A Survey0
A Multiclass Boosting Framework for Achieving Fast and Provable Adversarial Robustness0
Self-Supervised Multi-View Learning via Auto-Encoding 3D Transformations0
The Labeled Multiple Canonical Correlation Analysis for Information Fusion0
Achieving Competitive Play Through Bottom-Up Approach in Semantic Segmentation0
Learning Transferable Visual Models From Natural Language SupervisionCode2
CausalX: Causal Explanations and Block Multilinear Factor Analysis0
Comparative evaluation of CNN architectures for Image Caption GenerationCode0
Approximation of dilation-based spatial relations to add structural constraints in neural networks0
The Effects of Image Distribution and Task on Adversarial Robustness0
Efficient Online ML API Selection for Multi-Label Classification Tasks0
Learning Visual Models using a Knowledge Graph as a Trainer0
Grid Cell Path Integration For Movement-Based Visual Object RecognitionCode0
Comparison of semi-supervised deep learning algorithms for audio classificationCode1
An ecologically motivated image dataset for deep learning yields better models of human vision0
Seeing by haptic glance: reinforcement learning-based 3D object Recognition0
A Too-Good-to-be-True Prior to Reduce Shortcut Reliance0
Audiovisual Highlight Detection in Videos0
What does LIME really see in images?Code0
Enhancing efficiency of object recognition in different categorization levels by reinforcement learning in modular spiking neural networks0
Content Placement in Networks of Similarity Caches0
Large Scale Long-tailed Product Recognition System at Alibaba0
Overhead MNIST: A Benchmark Satellite Dataset0
Sill-Net: Feature Augmentation with Separated Illumination RepresentationCode1
Zero-shot Learning with Deep Neural Networks for Object Recognition0
The Effect of Class Definitions on the Transferability of Adversarial Attacks Against Forensic CNNs0
ISP Distillation0
A Spike Learning System for Event-driven Object Recognition0
The Unreasonable Effectiveness of Patches in Deep Convolutional Kernels MethodsCode0
A DCNN-based Arbitrarily-Oriented Object Detector for Quality Control and Inspection Application0
Machine learning with limited data0
Using Shape to Categorize: Low-Shot Learning with an Explicit Shape BiasCode1
Show:102550
← PrevPage 16 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified