SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 10511100 of 2042 papers

TitleStatusHype
Ensemble learning in CNN augmented with fully connected subnetworksCode0
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset0
Towards a Framework for Visual Intelligence in Service Robotics: Epistemic Requirements and Gap Analysis0
Measurement-driven Analysis of an Edge-Assisted Object Recognition System0
The iCub multisensor datasets for robot and computer vision applications0
Hand-Priming in Object Localization for Assistive Egocentric Vision0
Triangle-Net: Towards Robustness in Point Cloud LearningCode0
Compositional Embeddings for Multi-Label One-Shot Learning0
Investigating the Importance of Shape Features, Color Constancy, Color Spaces and Similarity Measures in Open-Ended 3D Object Recognition0
Variable-Viewpoint Representations for 3D Object Recognition0
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks0
Analyzing the Dependency of ConvNets on Spatial Information0
Learning Probabilistic Intersection Traffic Models for Trajectory Prediction0
Selective Segmentation Networks Using Top-Down Attention0
EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision0
AI-Powered GUI Attack and Its Defensive Methods0
Temporal Pulses Driven Spiking Neural Network for Fast Object Recognition in Autonomous Driving0
Deleting object selective units in a fully-connected layer of deep convolutional networks improves classification performance0
Block-wise Scrambled Image Recognition Using Adaptation Network0
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future0
A Transfer Learning Approach to Cross-Modal Object Recognition: From Visual Observation to Robotic Haptic Exploration0
Learning Transformation-Aware Embeddings for Image Forensics0
Multi-Scale Weight Sharing Network for Image Recognition0
Convolutional Networks with Dense Connectivity0
RMNv2: Reduced Mobilenet V2 for CIFAR100
Fractional order graph neural network0
UW-NET: AN INCEPTION-ATTENTION NETWORK FOR UNDERWATER IMAGE CLASSIFICATION0
KNN-Based Automatic Cropping for Improved Threat Object Recognition in X-Ray Security Images0
Experiments with mmWave Automotive Radar Test-bedCode0
Application of Deep Learning in Generating Desired Design Options: Experiments Using Synthetic Training Dataset0
Importance Filtered Cross-Domain Adaptation0
Interactive Open-Ended Learning for 3D Object Recognition0
Towards Contextual Learning in Few-shot Object Classification0
Deep-learning-based classification and retrieval of components of a process plant from segmented point clouds0
L3DOC: Lifelong 3D Object Classification0
Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos0
Associative Alignment for Few-shot Image ClassificationCode0
Detecting and Correcting Adversarial Images Using Image Processing Operations0
Object Recognition with Human in the Loop Intelligent Frameworks0
Improved Few-Shot Visual Classification0
300 GHz Radar Object Recognition based on Deep Neural Networks and Transfer Learning0
Video to Events: Recycling Video Datasets for Event CamerasCode0
Continual egocentric object recognitionCode0
A Neural Spiking Approach Compared to Deep Feedforward Networks on Stepwise Pixel Erasement0
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models0
Deliberative Explanations: visualizing network insecuritiesCode0
Representation Learning on Unit Ball with 3D Roto-Translational Equivariance0
Motion Equivariance OF Event-based Camera Data with the Temporal Normalization Transform0
GBCNs: Genetic Binary Convolutional Networks for Enhancing the Performance of 1-bit DCNNs0
Fine-grained Attention and Feature-sharing Generative Adversarial Networks for Single Image Super-ResolutionCode0
Show:102550
← PrevPage 22 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified