SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 19512000 of 2042 papers

TitleStatusHype
Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks0
Object Recognition System Design in Computer Vision: a Universal Approach0
DeCAF: A Deep Convolutional Activation Feature for Generic Visual RecognitionCode0
Image Description using Visual Dependency Representations0
Surface Registration Using Genetic Algorithm in Reduced Search Space0
Suspicious Object Recognition Method in Video Stream Based on Visual Attention0
MonoStream: A Minimal-Hardware High Accuracy Device-free WLAN Localization System0
VSEM: An open library for visual semantics representation0
Visual Features for Linguists: Basic image analysis techniques for multimodally-curious NLPers0
Domain-invariant Face Recognition using Learned Low-rank Transformation0
Attribute-Based Classification for Zero-Shot Visual Object Categorization0
Integration of 3D Object Recognition and Planning for Robotic Manipulation: A Preliminary Report0
A Novel Equation based Classifier for Detecting Human in Images0
Introducing Memory and Association Mechanism into a Biologically Inspired Visual Model0
iCub World: Friendly Robots Help Building Good Vision Data-Sets0
Feature Learning by Multidimensional Scaling and its Applications in Object RecognitionCode0
Matching objects across the textured-smooth continuum0
The Ripple Pond: Enabling Spiking Networks to See0
Large Margin Low Rank Tensor Analysis0
Annotation of Online Shopping Images without Labeled Training Examples0
More Words and Bigger Pictures0
SCaLE: Supervised and Cascaded Laplacian Eigenmaps for Visual Object Recognition Based on Nearest Neighbors0
Understanding Bayesian Rooms Using Composite 3D Object Models0
Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses0
Learning and Calibrating Per-Location Classifiers for Visual Place Recognition0
Weakly Supervised Learning of Mid-Level Features with Beta-Bernoulli Process Restricted Boltzmann Machines0
Mesh Based Semantic Modelling for Indoor and Outdoor Scenes0
Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images0
PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors0
Subspace Interpolation via Dictionary Learning for Unsupervised Domain Adaptation0
Sparse Output Coding for Large-Scale Visual Recognition0
Scene Parsing by Integrating Function, Geometry and Appearance Models0
Kernel Null Space Methods for Novelty Detection0
BFO Meets HOG: Feature Extraction Based on Histograms of Oriented p.d.f. Gradients for Image Classification0
Learning Collections of Part Models for Object Recognition0
Light Field Distortion Feature for Transparent Object Recognition0
Cartesian K-MeansCode0
SLAM++: Simultaneous Localisation and Mapping at the Level of Objects0
Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition0
Fully-Connected CRFs with Non-Parametric Pairwise Potential0
Adaptive Active Learning for Image Classification0
Image Optimization and Prediction0
An Adaptive Descriptor Design for Object Recognition in the Wild0
Reading Ancient Coin Legends: Object Recognition vs. OCR0
Image Retrieval based on Bag-of-Words model0
Rotational Projection Statistics for 3D Local Surface Description and Object Recognition0
Amplitude-Based Approach to Evidence Accumulation0
Multiple Kernel Sparse Representations for Supervised and Unsupervised Learning0
ImageNet Classification with Deep Convolutional Neural NetworksCode0
Large Scale Distributed Deep Networks0
Show:102550
← PrevPage 40 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified