SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 601650 of 2042 papers

TitleStatusHype
Reason from Context with Self-supervised Learning0
The purpose of qualia: What if human thinking is not (only) information processing?0
R2-MLP: Round-Roll MLP for Multi-View 3D Object RecognitionCode0
Adversarial Detection by Approximation of Ensemble Boundary0
MultiCrossViT: Multimodal Vision Transformer for Schizophrenia Prediction using Structural MRI and Functional Network Connectivity Data0
Egocentric Audio-Visual Noise Suppression0
Generating Clear Images From Images With Distortions Caused by Adverse Weather Using Generative Adversarial Networks0
State-of-the-art Models for Object Detection in Various Fields of Application0
Subsidiary Prototype Alignment for Universal Domain Adaptation0
Text2Model: Text-based Model Induction for Zero-shot Image Classification0
Object recognition in atmospheric turbulence scenesCode0
Context-driven Visual Object Recognition based on Knowledge Graphs0
Object Recognition in Different Lighting Conditions at Various Angles by Deep Learning Method0
Psychophysical-Score: A Behavioral Measure for Assessing the Biological Plausibility of Visual Recognition Models0
Variant Parallelism: Lightweight Deep Convolutional Models for Distributed Inference on IoT Devices0
Hand Gestures Recognition in Videos Taken with Lensless Camera0
Learning by Asking Questions for Knowledge-based Novel Object Recognition0
GraspCaps: A Capsule Network Approach for Familiar 6DoF Object Grasping0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Early or Late Fusion Matters: Efficient RGB-D Fusion in Vision Transformers for 3D Object Recognition0
Multipod Convolutional Network0
RECALL: Rehearsal-free Continual Learning for Object ClassificationCode0
Fusion of Inverse Synthetic Aperture Radar and Camera Images for Automotive Target Tracking0
Reconstruction-guided attention improves the robustness and shape processing of neural networksCode0
Transformer-Based Microbubble Localization0
TANDEM3D: Active Tactile Exploration for 3D Object Recognition0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
Data Augmentation by Selecting Mixed Classes Considering Distance Between Classes0
Continual Learning for Pose-Agnostic Object Recognition in 3D Point Clouds0
Measuring Human Perception to Improve Open Set Recognition0
Improving the Accuracy and Robustness of CNNs Using a Deep CCA Neural Data Regularizer0
Low-Energy Convolutional Neural Networks (CNNs) using Hadamard Method0
Chosen methods of improving small object recognition with weak recognizable features0
A Simulation Method for MMW Radar Sensing in Traffic Intersection Based on BART Algorithm0
How good are deep models in understanding the generated images?0
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding0
Noise-Adaptive Intelligent Programmable Meta-Imager0
Different Spectral Representations in Optimized Artificial Neural Networks and BrainsCode0
Background Invariance Testing According to Semantic Proximity0
Modeling biological face recognition with deep convolutional neural networks0
Automatic Ultrasound Image Segmentation of Supraclavicular Nerve Using Dilated U-Net Deep Learning Architecture0
A neuromorphic approach to image processing and machine vision0
A Proper Orthogonal Decomposition approach for parameters reduction of Single Shot Detector networks0
Easy Batch Normalization0
Semi-supervised Ranking for Object Image Blur AssessmentCode0
A Survey of Task-Based Machine Learning Content Extraction Services for VIDINT0
FewSOL: A Dataset for Few-Shot Object Learning in Robotic EnvironmentsCode0
Multi-area Target Individual Detection with Free Drawing on VideoCode0
SESS: Saliency Enhancing with Scaling and SlidingCode0
Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images0
Show:102550
← PrevPage 13 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified