SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 11511200 of 2042 papers

TitleStatusHype
Weakly Supervised Learning of Mid-Level Features with Beta-Bernoulli Process Restricted Boltzmann Machines0
Weakly Supervised Localization using Deep Feature Maps0
Webly Supervised Semantic Embeddings for Large Scale Zero-Shot Learning0
Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network0
What a difference a pixel makes: An empirical examination of features used by CNNs for categorisation0
What are the visual features underlying human versus machine vision?0
What Can I Do Around Here? Deep Functional Scene Understanding for Cognitive Robots0
What can we learn about CNNs from a large scale controlled object dataset?0
What deep learning can tell us about higher cognitive functions like mindreading?0
What do We Learn by Semantic Scene Understanding for Remote Sensing imagery in CNN framework?0
What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?0
What's in a Name? Beyond Class Indices for Image Recognition0
What's in my Room? Object Recognition on Indoor Panoramic Images0
What takes the brain so long: Object recognition at the level of minimal images develops for up to seconds of presentation time0
What you need to know about the state-of-the-art computational models of object-vision: A tour through the models0
When Regression Meets Manifold Learning for Object Recognition and Pose Estimation0
Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks0
Why The Brain Separates Face Recognition From Object Recognition0
Winograd Convolution for Deep Neural Networks: Efficient Point Selection0
X-model: Improving Data Efficiency in Deep Learning with A Minimax Model0
YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation0
YOLO and K-Means Based 3D Object Detection Method on Image and Point Cloud0
You Only Speak Once to See0
Your head is there to move you around: Goal-driven models of the primate dorsal pathway0
Zero-Aliasing Correlation Filters for Object Recognition0
Zero-Knowledge Zero-Shot Learning for Novel Visual Category Discovery0
Zero-shot counting with a dual-stream neural network model0
Zero Shot Hashing0
Zero-Shot Learning from Adversarial Feature Residual to Compact Visual Feature0
Zero-shot Learning with Deep Neural Networks for Object Recognition0
Zero-shot object prediction using semantic scene knowledge0
Zero-Shot Object Recognition by Semantic Manifold Distance0
Zero-Shot Object Recognition System based on Topic Model0
Zero-shot recognition with unreliable attributes0
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages?0
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM0
Generating Clear Images From Images With Distortions Caused by Adverse Weather Using Generative Adversarial Networks0
Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines0
Generating Photo-realistic Images from LiDAR Point Clouds with Generative Adversarial Networks0
Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models0
Generic decoding of seen and imagined objects using hierarchical visual features0
Geo-locating Road Objects using Inverse Haversine Formula with NVIDIA Driveworks0
GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing0
Geometry Aware Constrained Optimization Techniques for Deep Learning0
GLCM-Based Feature Combination for Extraction Model Optimization in Object Detection Using Machine Learning0
Glitch Classification and Clustering for LIGO with Deep Transfer Learning0
Global Deconvolutional Networks for Semantic Segmentation0
Going Deeper into Action Recognition: A Survey0
Gradient-based Laplacian Feature Selection0
Gradients of Counterfactuals0
Show:102550
← PrevPage 24 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified