SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 451500 of 2042 papers

TitleStatusHype
Coordinating Cross-modal Distillation for Molecular Property Prediction0
CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation0
Learning Dense Object Descriptors from Multiple Views for Low-shot Category GeneralizationCode1
Metric Learning as a Service with Covariance Embedding0
Learning to Learn: How to Continuously Teach Humans and Machines0
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
Reason from Context with Self-supervised Learning0
The purpose of qualia: What if human thinking is not (only) information processing?0
R2-MLP: Round-Roll MLP for Multi-View 3D Object RecognitionCode0
Adversarial Detection by Approximation of Ensemble Boundary0
MultiCrossViT: Multimodal Vision Transformer for Schizophrenia Prediction using Structural MRI and Functional Network Connectivity Data0
Harmonizing the object recognition strategies of deep neural networks with humansCode1
Egocentric Audio-Visual Noise Suppression0
Generating Clear Images From Images With Distortions Caused by Adverse Weather Using Generative Adversarial Networks0
State-of-the-art Models for Object Detection in Various Fields of Application0
Subsidiary Prototype Alignment for Universal Domain Adaptation0
Object Segmentation of Cluttered Airborne LiDAR Point CloudsCode1
Text2Model: Text-based Model Induction for Zero-shot Image Classification0
Object recognition in atmospheric turbulence scenesCode0
Context-driven Visual Object Recognition based on Knowledge Graphs0
Object Recognition in Different Lighting Conditions at Various Angles by Deep Learning Method0
Psychophysical-Score: A Behavioral Measure for Assessing the Biological Plausibility of Visual Recognition Models0
Variant Parallelism: Lightweight Deep Convolutional Models for Distributed Inference on IoT Devices0
Hand Gestures Recognition in Videos Taken with Lensless Camera0
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNsCode1
Learning by Asking Questions for Knowledge-based Novel Object Recognition0
The Equalization Losses: Gradient-Driven Training for Long-tailed Object RecognitionCode2
GraspCaps: A Capsule Network Approach for Familiar 6DoF Object Grasping0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Early or Late Fusion Matters: Efficient RGB-D Fusion in Vision Transformers for 3D Object Recognition0
Multipod Convolutional Network0
Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022Code1
RECALL: Rehearsal-free Continual Learning for Object ClassificationCode0
Reconstruction-guided attention improves the robustness and shape processing of neural networksCode0
Fusion of Inverse Synthetic Aperture Radar and Camera Images for Automotive Target Tracking0
OBBStacking: An Ensemble Method for Remote Sensing Object DetectionCode1
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoCode1
Li-ion battery degradation modes diagnosis via Convolutional Neural NetworksCode1
Transformer-Based Microbubble Localization0
TANDEM3D: Active Tactile Exploration for 3D Object Recognition0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
Visual Recognition with Deep Nearest CentroidsCode1
Data Augmentation by Selecting Mixed Classes Considering Distance Between Classes0
Continual Learning for Pose-Agnostic Object Recognition in 3D Point Clouds0
Measuring Human Perception to Improve Open Set Recognition0
Low-Energy Convolutional Neural Networks (CNNs) using Hadamard Method0
Improving the Accuracy and Robustness of CNNs Using a Deep CCA Neural Data Regularizer0
Chosen methods of improving small object recognition with weak recognizable features0
A Simulation Method for MMW Radar Sensing in Traffic Intersection Based on BART Algorithm0
How good are deep models in understanding the generated images?0
Show:102550
← PrevPage 10 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified