SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 110 of 2042 papers

TitleStatusHype
GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing0
Out-of-distribution detection in 3D applications: a review0
SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point CloudsCode0
Continual Hyperbolic Learning of Instances and Classes0
DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects0
Aligning Text, Images, and 3D Structure Token-by-Token0
STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous DrivingCode1
Feature-Based Lie Group Transformer for Real-World Applications0
EV-Flying: an Event-based Dataset for In-The-Wild Recognition of Flying Objects0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Show:102550
← PrevPage 1 of 205Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified