Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 2042 papers

Title	Date	Tasks	Status	Hype
Comics Datasets Framework: Mix of Comics datasets for detection benchmarking	Jul 3, 2024	BenchmarkingObject	CodeCode Available	1
The 3D-PC: a benchmark for visual perspective taking in humans and machines	Jun 6, 2024	Object Recognition	CodeCode Available	1
Bilateral Event Mining and Complementary for Event Stream Super-Resolution	May 16, 2024	Object RecognitionSuper-Resolution	CodeCode Available	1
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models	Apr 17, 2024	HallucinationMultimodal Reasoning	CodeCode Available	1
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models	Apr 11, 2024	AttributeObject	CodeCode Available	1
EventRPG: Event Data Augmentation with Relevance Propagation Guidance	Mar 14, 2024	Action RecognitionData Augmentation	CodeCode Available	1
MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding	Mar 5, 2024	3D visual groundingDecision Making	CodeCode Available	1
CLoVe: Encoding Compositional Language in Contrastive Vision-Language Models	Feb 22, 2024	Object RecognitionRetrieval	CodeCode Available	1
SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models	Feb 6, 2024	AttributeFace Anti-Spoofing	CodeCode Available	1
Self-supervised learning of video representations from a child's perspective	Feb 1, 2024	Object RecognitionSelf-Supervised Learning	CodeCode Available	1
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed Data	Dec 14, 2023	Contrastive LearningFederated Learning	CodeCode Available	1
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction	Dec 4, 2023	3D geometryAutonomous Driving	CodeCode Available	1
Object Recognition as Next Token Prediction	Dec 4, 2023	DecoderLanguage Modeling	CodeCode Available	1
E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning	Nov 30, 2023	Image ReconstructionObject Recognition	CodeCode Available	1
Lidar Annotation Is All You Need	Nov 8, 2023	AllAutonomous Driving	CodeCode Available	1
Recognize Any Regions	Nov 2, 2023	object-detectionObject Detection	CodeCode Available	1
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects	Oct 19, 2023	3D Object Recognition6D Pose Estimation	CodeCode Available	1
Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-ends	Oct 16, 2023	Object Recognition	CodeCode Available	1
Intriguing properties of generative classifiers	Sep 28, 2023	Object Recognition	CodeCode Available	1
LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition	Sep 22, 2023	ObjectObject Recognition	CodeCode Available	1
Divergences in Color Perception between Deep Neural Networks and Humans	Sep 11, 2023	image-classificationImage Classification	CodeCode Available	1
Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art	Sep 10, 2023	Objectobject-detection	CodeCode Available	1
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN	Sep 4, 2023	object-detectionObject Detection	CodeCode Available	1
Decoding Natural Images from EEG for Object Recognition	Aug 25, 2023	Contrastive LearningEEG	CodeCode Available	1
Label-Free Event-based Object Recognition via Joint Learning with Image Reconstruction from Events	Aug 18, 2023	Image ReconstructionObject	CodeCode Available	1
Scaling may be all you need for achieving human-level object recognition capacity with human-like visual experience	Aug 7, 2023	AllObject Recognition	CodeCode Available	1
DesCo: Learning Object Recognition with Rich Language Descriptions	Jun 24, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
EventCLIP: Adapting CLIP for Event-based Object Recognition	Jun 10, 2023	Few-Shot LearningObject	CodeCode Available	1
Paxion: Patching Action Knowledge in Video-Language Foundation Models	May 18, 2023	Action UnderstandingDiagnostic	CodeCode Available	1
Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery	May 10, 2023	Contrastive Learningimage-classification	CodeCode Available	1
Discover and Cure: Concept-aware Mitigation of Spurious Correlation	May 1, 2023	Lesion ClassificationObject Recognition	CodeCode Available	1
From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection	Apr 26, 2023	Event-based visionobject-detection	CodeCode Available	1
Explainable GeoAI: Can saliency maps help interpret artificial intelligence's learning process? An empirical study on natural feature detection	Mar 16, 2023	Deep LearningObject Recognition	CodeCode Available	1
Learning Efficient Coding of Natural Images with Maximum Manifold Capacity Representations	Mar 6, 2023	Contrastive LearningObject Recognition	CodeCode Available	1
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1
Towards Local Visual Modeling for Image Captioning	Feb 13, 2023	Image CaptioningObject Recognition	CodeCode Available	1
TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction	Jan 5, 2023	ObjectObject Recognition	CodeCode Available	1
TempSAL - Uncovering Temporal Information for Deep Saliency Prediction	Jan 1, 2023	ObjectObject Recognition	CodeCode Available	1
Part-guided Relational Transformers for Fine-grained Visual Recognition	Dec 28, 2022	Fine-Grained Image ClassificationFine-Grained Visual Recognition	CodeCode Available	1
Doubly Right Object Recognition: A Why Prompt for Visual Rationales	Dec 12, 2022	Object Recognition	CodeCode Available	1
PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization	Dec 2, 2022	Domain Generalizationobject-detection	CodeCode Available	1
Learning Dense Object Descriptors from Multiple Views for Low-shot Category Generalization	Nov 28, 2022	Novel View SynthesisObject	CodeCode Available	1
Harmonizing the object recognition strategies of deep neural networks with humans	Nov 8, 2022	ObjectObject Recognition	CodeCode Available	1
Object Segmentation of Cluttered Airborne LiDAR Point Clouds	Oct 28, 2022	ObjectObject Recognition	CodeCode Available	1
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNs	Oct 12, 2022	Object Recognition	CodeCode Available	1
Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022	Oct 1, 2022	Few-Shot Image ClassificationObject Recognition	CodeCode Available	1
OBBStacking: An Ensemble Method for Remote Sensing Object Detection	Sep 27, 2022	Earth ObservationObject	CodeCode Available	1
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video	Sep 25, 2022	Long-tail Video Object SegmentationMulti-Object Tracking	CodeCode Available	1
Li-ion battery degradation modes diagnosis via Convolutional Neural Networks	Sep 24, 2022	Battery diagnosisObject Recognition	CodeCode Available	1
Visual Recognition with Deep Nearest Centroids	Sep 15, 2022	Decision Makingimage-classification	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 41Next →

All datasets shape bias CIFAR10-DVS N-Caltech 101 ObjectNet (All classes)ObjectNet (ImageNet classes)ObjectNet (ImageNet classes, trained on ImageNet)DVS128 Gesture MECCANO N-CARS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Imagen	shape bias	98.7	—	Unverified
2	Stable Diffusion	shape bias	92.7	—	Unverified
3	Parti	shape bias	91.7	—	Unverified
4	ViT-22B-384	shape bias	86.4	—	Unverified
5	ViT-22B-560	shape bias	83.8	—	Unverified
6	CLIP (ViT-B)	shape bias	79.9	—	Unverified
7	ViT-22B-224	shape bias	78	—	Unverified
8	ResNet-50 (L2 eps 5.0 adv trained)	shape bias	69.5	—	Unverified
9	ResNet-50 (with strong augmentations)	shape bias	62.2	—	Unverified
10	SWSL (ResNeXt-101)	shape bias	49.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.55	—	Unverified
2	SSNN	Accuracy (% )	78.57	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.62	—	Unverified
2	SSNN	Accuracy (% )	79.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	18.75	—	Unverified
2	yun	Top 5 Accuracy	14.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	DY	Top 5 Accuracy	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	AJ2021	Top 5 Accuracy	27.68	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SSNN	Accuracy (% )	94.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Faster-RCNN	mAP	30.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	96	—	Unverified