SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 17011750 of 2042 papers

TitleStatusHype
BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition0
Bio-inspired learnable divisive normalization for ANNs0
Bio-inspired Unsupervised Learning of Visual Features Leads to Robust Invariant Object Recognition0
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once0
Block-wise Scrambled Image Recognition Using Adaptation Network0
Boosting Cross-task Transferability of Adversarial Patches with Visual Relations0
Boosting Gradient for White-Box Adversarial Attacks0
Boosting Object Recognition in Point Clouds by Saliency Detection0
Boosting with Maximum Adaptive Sampling0
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition0
BPGrad: Towards Global Optimality in Deep Learning via Branch and Pruning0
Brain Cancer Segmentation Using YOLOv5 Deep Neural Network0
Brain Inspired Face Recognition: A Computational Framework0
Brain-Like Object Recognition Neural Networks are more robustness to common corruptions0
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism0
BranchConnect: Large-Scale Visual Recognition with Learned Branch Connections0
Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition0
Bridging the Gap Between Computational Photography and Visual Recognition0
Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation0
Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture0
BSED: Baseline Shapley-Based Explainable Detector0
Building a visual semantics aware object hierarchy0
Building Machines That Learn and Think Like People0
C3PO: Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing0
CAggNet: Crossing Aggregation Network for Medical Image Segmentation0
Can Boosting with SVM as Week Learners Help?0
Can domain adaptation make object recognition work for everyone?0
Can foundation models actively gather information in interactive environments to test hypotheses?0
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection0
Capacity limitations of visual search in deep convolutional neural networks0
Capturing the objects of vision with neural networks0
Cascade Region Proposal and Global Context for Deep Object Detection0
Catastrophic Child's Play: Easy to Perform, Hard to Defend Adversarial Attacks0
Categorical Mixture Models on VGGNet activations0
Categories and Functional Units: An Infinite Hierarchical Model for Brain Activations0
CausalX: Causal Explanations and Block Multilinear Factor Analysis0
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding0
Certifiable Artificial Intelligence Through Data Fusion0
Characterising representation dynamics in recurrent neural networks for object recognition0
ChartKG: A Knowledge-Graph-Based Representation for Chart Images0
ChoiceNet: CNN learning through choice of multiple feature map representations0
Chosen methods of improving small object recognition with weak recognizable features0
CIFAR10 to Compare Visual Recognition Performance between Deep Neural Networks and Humans0
Classification and Geometry of General Perceptual Manifolds0
Classifier-to-Generator Attack: Estimation of Training Data Distribution from Classifier0
Classifying Malware Images with Convolutional Neural Network Models0
Class incremental learning for video action classification0
Class-specific Anchoring Proposal for 3D Object Recognition in LIDAR and RGB Images0
CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation0
Cloud based Scalable Object Recognition from Video Streams using Orientation Fusion and Convolutional Neural Networks0
Show:102550
← PrevPage 35 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified