SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 15011550 of 2042 papers

TitleStatusHype
Image classification using local tensor singular value decompositions0
A New Urban Objects Detection Framework Using Weakly Annotated Sets0
Controlled Tactile Exploration and Haptic Object Recognition0
Do Deep Neural Networks Suffer from Crowding?Code0
Deep Mixture of Diverse Experts for Large-Scale Visual Recognition0
Deep Transfer Learning: A new deep learning glitch classification method for advanced LIGO0
Comparing deep neural networks against humans: object recognition when the signal gets weakerCode0
Two-Stream Convolutional Networks for Dynamic Texture SynthesisCode0
Analysis of dropout learning regarded as ensemble learning0
Human-like Clustering with Deep Convolutional Neural NetworksCode0
Teaching Compositionality to CNNs0
Enriched Deep Recurrent Visual Attention Model for Multiple Object Recognition0
MirBot: A collaborative object recognition system for smartphones using convolutional neural networks0
CortexNet: a Generic Network Family for Robust Visual Temporal Representations0
Incorporating Network Built-in Priors in Weakly-supervised Semantic Segmentation0
A Random-Fern based Feature Approach for Image Matching0
Deep-Learning Convolutional Neural Networks for scattered shrub detection with Google Earth Imagery0
Non-Uniform Subset Selection for Active Learning in Structured DataCode0
Reflection Invariant and Symmetry Detection0
Multi-View Task-Driven Recognition in Visual Sensor Networks0
First-spike based visual categorization using reward-modulated STDP0
Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models0
Learning Robust Object Recognition Using Composed Scenes from Generative Models0
Forecasting Hands and Objects in Future Frames0
What do We Learn by Semantic Scene Understanding for Remote Sensing imagery in CNN framework?0
Learning Hard Alignments with Variational Inference0
View-Invariant Template Matching Using Homography Constraints0
Using Satellite Imagery for Good: Detecting Communities in Desert and Mapping Vaccination Activities0
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks0
Reducing Bias in Production Speech Models0
Collaborative Descriptors: Convolutional Maps for Preprocessing0
CORe50: a New Dataset and Benchmark for Continuous Object RecognitionCode0
Bridging between Computer and Robot Vision through Data Augmentation: a Case Study on Object Recognition0
DeepCorrect: Correcting DNN models against Image DistortionsCode0
From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis0
Recurrent Soft Attention Model for Common Object RecognitionCode0
Inception Recurrent Convolutional Neural Network for Object RecognitionCode0
Towards Instance Segmentation with Object Priority: Prominent Object Detection and Recognition0
Residual Attention Network for Image ClassificationCode0
Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition0
Invariant recognition drives neural representations of action sequences0
BranchConnect: Large-Scale Visual Recognition with Learned Branch Connections0
Ranking to Learn: Feature Ranking and Selection via Eigenvector Centrality0
Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image ClassificationCode0
Deep Affordance-grounded Sensorimotor Object Recognition0
Generate To Adapt: Aligning Domains using Generative Adversarial NetworksCode0
Pragmatic descriptions of perceptual stimuli0
Multiple Instance Detection Network with Online Instance Classifier RefinementCode1
Learning with Privileged Information for Multi-Label Classification0
Object categorization in finer levels requires higher spatial frequencies, and therefore takes longer0
Show:102550
← PrevPage 31 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified