SOTAVerified

Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Showing 701750 of 2042 papers

TitleStatusHype
Reducing Overconfidence Predictions for Autonomous Driving Perception0
BViT: Broad Attention based Vision TransformerCode0
Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection0
Positive-Unlabeled Domain Adaptation0
Comparative Study Between Distance Measures On Supervised Optimum-Path Forest ClassificationCode0
Are Transformers More Robust? Towards Exact Robustness Verification for Transformers0
Radar-based Materials Classification Using Deep Wavelet Scattering Transform: A Comparison of Centimeter vs. Millimeter Wave Units0
SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning0
A Probabilistic Framework for Dynamic Object Recognition in 3D Environment With A Novel Continuous Ground Estimation Method0
Dynamic Rectification Knowledge DistillationCode0
Winograd Convolution for Deep Neural Networks: Efficient Point Selection0
A Machine Learning Framework for Distributed Functional Compression over Wireless Channels in IoT0
Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations0
Task Independent Capsule-Based Agents for Deep Q-Learning0
Geometric and Textural Augmentation for Domain Gap ReductionCode0
AU Dataset for Visuo-Haptic Object Recognition for Robots0
Contrastive Object Detection Using Knowledge Graph Embeddings0
Generating Photo-realistic Images from LiDAR Point Clouds with Generative Adversarial Networks0
Object Recognition as Classification via Visual Properties0
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision0
Co-training Transformer with Videos and Images Improves Action Recognition0
Controlled-rearing studies of newborn chicks and deep neural networks0
On Adversarial Robustness of Point Cloud Semantic SegmentationCode0
A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition0
Sparse Depth Completion with Semantic Mesh Deformation Optimization0
Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for Event-Based Vision0
Panoptic-aware Image-to-Image Translation0
An online passive-aggressive algorithm for difference-of-squares classification0
Your head is there to move you around: Goal-driven models of the primate dorsal pathway0
Material Classification Using Active Temperature Controllable Robotic Gripper0
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds0
Efficient Anomaly Detection Using Self-Supervised Multi-Cue Tasks0
Inducing Functions through Reinforcement Learning without Task Specification0
Object Recognition by a Minimally Pre-Trained System in the Process of Environment Exploration0
Adversarial Examples on Segmentation Models Can be Easy to Transfer0
The Neural Correlates of Image Texture in the Human Vision Using Magnetoencephalography0
ShapeY: Measuring Shape Recognition Capacity Using Nearest Neighbor Matching0
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge0
6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping0
Unsupervised Spiking Instance Segmentation on Event Data using STDP0
Grassmannian learning mutual subspace method for image set recognition0
Development of collective behavior in newborn artificial agents0
Certifiable Artificial Intelligence Through Data Fusion0
Resampling and super-resolution of hexagonally sampled images using deep learning0
Investigating Negation in Pre-trained Vision-and-language ModelsCode0
Latent Cognizance: What Machine Really Learns0
HR-RCNN: Hierarchical Relational Reasoning for Object Detection0
MVT: Multi-view Vision Transformer for 3D Object RecognitionCode0
On Label-Efficient Computer Vision: Building Fast and Effective Few-Shot Image Classifiers0
Knowledge-driven Active LearningCode0
Show:102550
← PrevPage 15 of 41Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Imagenshape bias98.7Unverified
2Stable Diffusionshape bias92.7Unverified
3Partishape bias91.7Unverified
4ViT-22B-384shape bias86.4Unverified
5ViT-22B-560shape bias83.8Unverified
6CLIP (ViT-B)shape bias79.9Unverified
7ViT-22B-224shape bias78Unverified
8ResNet-50 (L2 eps 5.0 adv trained)shape bias69.5Unverified
9ResNet-50 (with strong augmentations)shape bias62.2Unverified
10SWSL (ResNeXt-101)shape bias49.8Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.55Unverified
2SSNNAccuracy (% )78.57Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )85.62Unverified
2SSNNAccuracy (% )79.25Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy18.75Unverified
2yunTop 5 Accuracy14.75Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2DYTop 5 Accuracy0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ObjectNet-BaselineTop 5 Accuracy52.24Unverified
2AJ2021Top 5 Accuracy27.68Unverified
#ModelMetricClaimedVerifiedStatus
1SSNNAccuracy (% )94.91Unverified
#ModelMetricClaimedVerifiedStatus
1Faster-RCNNmAP30.39Unverified
#ModelMetricClaimedVerifiedStatus
1Spike-VGG11Accuracy (% )96Unverified