Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 2042 papers

Title	Date	Tasks	Status	Hype	Score
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1	5
On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey	Aug 9, 2024	Object Recognition	CodeCode Available	1	5
Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning	May 25, 2016	Object RecognitionVideo Prediction	CodeCode Available	1	5
DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects	Mar 27, 2018	General ClassificationObject	CodeCode Available	1	5
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax	Jun 18, 2020	image-classificationImage Classification	CodeCode Available	1	5
Learning what and where to attend	May 22, 2018	DiagnosticImage Categorization	CodeCode Available	1	5
Deep Subdomain Adaptation Network for Image Classification	Jun 17, 2021	ClassificationDomain Adaptation	CodeCode Available	1	5
From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection	Apr 26, 2023	Event-based visionobject-detection	CodeCode Available	1	5
Rehearsal-Free Continual Learning over Small Non-I.I.D. Batches	Jul 8, 2019	class-incremental learningClass Incremental Learning	CodeCode Available	1	5
PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition	Jul 15, 2024	Adversarial RobustnessInductive Bias	CodeCode Available	1	5
Describing Textures in the Wild	Nov 14, 2013	Material RecognitionObject Recognition	CodeCode Available	1	5
Are Convolutional Neural Networks or Transformers more like human vision?	May 15, 2021	BIG-bench Machine LearningObject Recognition	CodeCode Available	1	5
DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection	Mar 17, 2022	object-detectionObject Detection	CodeCode Available	1	5
Causal Transportability for Visual Recognition	Apr 26, 2022	image-classificationImage Classification	CodeCode Available	1	5
Paxion: Patching Action Knowledge in Video-Language Foundation Models	May 18, 2023	Action UnderstandingDiagnostic	CodeCode Available	1	5
RAMP-CNN: A Novel Neural Network for Enhanced Automotive Radar Object Recognition	Nov 13, 2020	object-detectionObject Detection	CodeCode Available	1	5
Recognize Any Regions	Nov 2, 2023	object-detectionObject Detection	CodeCode Available	1	5
Divergences in Color Perception between Deep Neural Networks and Humans	Sep 11, 2023	image-classificationImage Classification	CodeCode Available	1	5
Distributed Deep Neural Networks over the Cloud, the Edge and End Devices	Sep 6, 2017	Distributed ComputingObject Recognition	CodeCode Available	1	5
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation	Aug 13, 2020	ClassificationFew-Shot Object Detection	CodeCode Available	1	5
DOCTOR: A Simple Method for Detecting Misclassification Errors	Jun 4, 2021	Object RecognitionSentiment Analysis	CodeCode Available	1	5
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects	Oct 19, 2023	3D Object Recognition6D Pose Estimation	CodeCode Available	1	5
Robust and efficient post-processing for video object detection	Sep 23, 2020	Autonomous DrivingObject	CodeCode Available	1	5
Going Deeper with Convolutions	Sep 17, 2014	General ClassificationImage Classification	CodeCode Available	1	5
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection	Jul 31, 2024	Language ModellingObject	CodeCode Available	1	5

Show:10 25 50

← PrevPage 8 of 82Next →

All datasets shape bias CIFAR10-DVS N-Caltech 101 ObjectNet (All classes)ObjectNet (ImageNet classes)ObjectNet (ImageNet classes, trained on ImageNet)DVS128 Gesture MECCANO N-CARS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Imagen	shape bias	98.7	—	Unverified
2	Stable Diffusion	shape bias	92.7	—	Unverified
3	Parti	shape bias	91.7	—	Unverified
4	ViT-22B-384	shape bias	86.4	—	Unverified
5	ViT-22B-560	shape bias	83.8	—	Unverified
6	CLIP (ViT-B)	shape bias	79.9	—	Unverified
7	ViT-22B-224	shape bias	78	—	Unverified
8	ResNet-50 (L2 eps 5.0 adv trained)	shape bias	69.5	—	Unverified
9	ResNet-50 (with strong augmentations)	shape bias	62.2	—	Unverified
10	SWSL (ResNeXt-101)	shape bias	49.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.55	—	Unverified
2	SSNN	Accuracy (% )	78.57	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.62	—	Unverified
2	SSNN	Accuracy (% )	79.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	18.75	—	Unverified
2	yun	Top 5 Accuracy	14.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	DY	Top 5 Accuracy	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	AJ2021	Top 5 Accuracy	27.68	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SSNN	Accuracy (% )	94.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Faster-RCNN	mAP	30.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	96	—	Unverified