SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 251300 of 2042 papers

TitleStatusHype
Improving Annotation for 3D Pose Dataset of Fine-Grained Object CategoriesCode0
Bayesian and Neural Inference on LSTM-based Object Recognition from Tactile and Kinesthetic InformationCode0
Imagine2touch: Predictive Tactile Sensing for Robotic Manipulation using Efficient Low-Dimensional SignalsCode0
Image Style Transfer Using Convolutional Neural NetworksCode0
Towards Interpreting Recurrent Neural Networks through Probabilistic AbstractionCode0
Image Captioning using Deep Neural ArchitecturesCode0
Algorithms for Semantic Segmentation of Multispectral Remote Sensing Imagery using Deep LearningCode0
ImageNet Classification with Deep Convolutional Neural NetworksCode0
Image Privacy Prediction Using Deep Neural NetworksCode0
Investigating the Gestalt Principle of Closure in Deep Convolutional Neural NetworksCode0
A Dataset and Framework for Learning State-invariant Object RepresentationsCode0
Human-like Clustering with Deep Convolutional Neural NetworksCode0
Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial NoisesCode0
Human Pose Estimation for Real-World Crowded ScenariosCode0
Hierarchical Superpixel Segmentation via Structural Information TheoryCode0
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?Code0
A comparison between humans and AI at recognizing objects in unusual posesCode0
Associative Alignment for Few-shot Image ClassificationCode0
HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual RecognitionCode0
Handwritten Bangla Character Recognition Using The State-of-Art Deep Convolutional Neural NetworksCode0
Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language ModelsCode0
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes ProsthesisCode0
Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agentsCode0
Adapting Deep Network Features to Capture Psychological RepresentationsCode0
Global Second-order Pooling Convolutional NetworksCode0
Grid Cell Path Integration For Movement-Based Visual Object RecognitionCode0
Geometric and Textural Augmentation for Domain Gap ReductionCode0
Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop ObjectsCode0
Generate To Adapt: Aligning Domains using Generative Adversarial NetworksCode0
Genetic CNNCode0
Grounded Human-Object Interaction Hotspots from VideoCode0
Tuned Compositional Feature Replays for Efficient Stream LearningCode0
GAANet: Ghost Auto Anchor Network for Detecting Varying Size Drones in DarkCode0
Generalisation in humans and deep neural networksCode0
Foveation in the Era of Deep LearningCode0
Foveated Instance SegmentationCode0
FPNN: Field Probing Neural Networks for 3D DataCode0
Machine learning with neural networksCode0
Artificial Color Constancy via GoogLeNet with Angular Loss FunctionCode0
Fit to Measure: Reasoning about Sizes for Robust Object RecognitionCode0
Are we done with object recognition? The iCub robot's perspectiveCode0
Food Image Recognition by Using Convolutional Neural Networks (CNNs)Code0
Are Vision Transformers More Data Hungry Than Newborn Visual Systems?Code0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
FoodTracker: A Real-time Food Detection Mobile Application by Deep Convolutional Neural NetworksCode0
FewSOL: A Dataset for Few-Shot Object Learning in Robotic EnvironmentsCode0
Feature Pyramid GridsCode0
Finding Tiny FacesCode0
Feature Learning by Multidimensional Scaling and its Applications in Object RecognitionCode0
3D Object Recognition with Ensemble Learning --- A Study of Point Cloud-Based Deep Learning ModelsCode0
Show:102550
← PrevPage 6 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified