SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 14011450 of 2042 papers

TitleStatusHype
Learning to See by Moving0
Learning to Segment Moving Objects0
Learning Transferrable Representations for Unsupervised Domain Adaptation0
Learning Transformation-Aware Embeddings for Image Forensics0
Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images0
Learning visual biases from human imagination0
Learning what and where to attend with humans in the loop0
Teaching What You Should Teach: A Data-Based Distillation Method0
Learning with Privileged Information for Multi-Label Classification0
Learning with Recursive Perceptual Representations0
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision0
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection0
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection0
Lifelong Ensemble Learning based on Multiple Representations for Few-Shot Object Recognition0
Lifting Object Detection Datasets into 3D0
Lift-the-flap: what, where and when for context reasoning0
LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection0
Light Field Distortion Feature for Transparent Object Recognition0
Limited but consistent gains in adversarial robustness by co-training object recognition models with human EEG0
Linking Entities Across Images and Text0
LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition0
Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation0
Localized random projections challenge benchmarks for bio-plausible deep learning0
LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition0
Logical recognition method for solving the problem of identification in the Internet of Things0
Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction0
Look Further to Recognize Better: Learning Shared Topics and Category-Specific Dictionaries for Open-Ended 3D Object Recognition0
Look, Remember and Reason: Grounded reasoning in videos with language models0
Low-Energy Convolutional Neural Networks (CNNs) using Hadamard Method0
Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation0
LSD-Net: Look, Step and Detect for Joint Navigation and Multi-View Recognition with Deep Reinforcement Learning0
Machine Learning and Big Scientific Data0
Machine Learning Computer Vision Applications for Spatial AI Object Recognition in Orange County, California0
Machine learning with limited data0
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition0
Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve0
Matching objects across the textured-smooth continuum0
Material Classification Using Active Temperature Controllable Robotic Gripper0
MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection0
Maximum Likelihood Directed Enumeration Method in Piecewise-Regular Object Recognition0
mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets0
Measurement Bounds for Sparse Signal Reconstruction with Multiple Side Information0
Measurement-driven Analysis of an Edge-Assisted Object Recognition System0
Measuring and Understanding Sensory Representations within Deep Networks Using a Numerical Optimization Framework0
Measuring Human Perception to Improve Open Set Recognition0
Medical Deep Learning -- A systematic Meta-Review0
Merging SVMs with Linear Discriminant Analysis: A Combined Model0
Mesh Based Semantic Modelling for Indoor and Outdoor Scenes0
MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition0
Meta-forests: Domain generalization on random forests with meta-learning0
Show:102550
← PrevPage 29 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified