SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 12011250 of 2042 papers

TitleStatusHype
Self-Supervised Multi-View Learning via Auto-Encoding 3D Transformations0
Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning0
Self-supervised Visual Attribute Learning for Fashion Compatibility0
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer0
Semantic Kernel Forests from Multiple Taxonomies0
Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need0
Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer0
Semantic Topic Analysis of Traffic Camera Images0
Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition0
Semi-supervised Training Data Generation for Multilingual Question Answering0
Semi-supervised Zero-Shot Learning by a Clustering-based Approach0
Sensitivity of sparse codes to image distortions0
Sensory Optimization: Neural Networks as a Model for Understanding and Creating Art0
Sequence-based Person Attribute Recognition with Joint CTC-Attention Model0
Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition0
ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids0
ShapeY: Measuring Shape Recognition Capacity Using Nearest Neighbor Matching0
Shift from Texture-bias to Shape-bias: Edge Deformation-based Augmentation for Robust Object Recognition0
Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network0
Signature of Geometric Centroids for 3D Local Shape Description and Partial Shape Matching0
Significance of feedforward architectural differences between the ventral visual stream and DenseNet0
Simulating reaction time for Eureka effect in visual object recognition using artificial neural network0
Simultaneous Multi-View Object Recognition and Grasping in Open-Ended Domains0
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition0
Simultaneous View and Feature Selection for Collaborative Multi-Robot Perception0
SISA: Securing Images by Selective Alteration0
SizeNet: Object Recognition via Object Real Size-based Convolutional Networks0
SLAM++: Simultaneous Localisation and Mapping at the Level of Objects0
Contour Sparse Representation with SDD Features for Object Recognition0
SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients0
Soft-margin classification of object manifolds0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications0
Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices0
SPARK: SPAcecraft Recognition leveraging Knowledge of Space Environment0
Sparse Depth Completion with Semantic Mesh Deformation Optimization0
Sparse distributed localized gradient fused features of objects0
Sparse Output Coding for Large-Scale Visual Recognition0
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models0
Spatial-Aware Graph Relation Network for Large-Scale Object Detection0
Spatio-Temporal Facial Expression Recognition Using Convolutional Neural Networks and Conditional Random Fields0
Spatiotemporal Feature Learning for Event-Based Vision0
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition0
SpecNet: Spectral Domain Convolutional Neural Network0
Spectral Dependency Parsing with Latent Variables0
Spectral Processing and Optimization of Static and Dynamic 3D Geometries0
Speech recognition in Alzheimer's disease with personal assistive robots0
Spherical Convolutional Neural Networks: Stability to Perturbations in SO(3)0
SpikeNAS: A Fast Memory-Aware Neural Architecture Search Framework for Spiking Neural Network-based Autonomous Agents0
SqueezeJet: High-level Synthesis Accelerator Design for Deep Convolutional Neural Networks0
SRTransGAN: Image Super-Resolution using Transformer based Generative Adversarial Network0
Show:102550
← PrevPage 25 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified