Object Recognition

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 2042 papers

Title	Date	Tasks	Status	Hype	Score
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency	Apr 24, 2025	BenchmarkingMath	CodeCode Available	1	5
Computing the Testing Error Without a Testing Set	Jun 1, 2020	Object RecognitionSemantic Segmentation	CodeCode Available	1	5
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency	Jun 16, 2022	Domain AdaptationObject Recognition	CodeCode Available	1	5
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection	Dec 23, 2024	object-detectionObject Detection	CodeCode Available	1	5
Large-scale Remote Sensing Image Target Recognition and Automatic Annotation	Nov 12, 2024	Ensemble LearningObject	CodeCode Available	1	5
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics	Dec 17, 2024	Objectobject-detection	CodeCode Available	1	5
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects	Oct 19, 2023	3D Object Recognition6D Pose Estimation	CodeCode Available	1	5
Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022	Oct 1, 2022	Few-Shot Image ClassificationObject Recognition	CodeCode Available	1	5
Convolutional Neural Networks with Gated Recurrent Connections	Jun 5, 2021	object-detectionObject Detection	CodeCode Available	1	5
Adaptive Subspaces for Few-Shot Learning	Jun 1, 2020	Few-Shot Image ClassificationFew-Shot Learning	CodeCode Available	1	5
Rehearsal-Free Continual Learning over Small Non-I.I.D. Batches	Jul 8, 2019	class-incremental learningClass Incremental Learning	CodeCode Available	1	5
Intriguing properties of generative classifiers	Sep 28, 2023	Object Recognition	CodeCode Available	1	5
F-SIOL-310: A Robotic Dataset and Benchmark for Few-Shot Incremental Object Learning	Mar 23, 2021	Incremental LearningObject	CodeCode Available	1	5
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models	Apr 17, 2024	HallucinationMultimodal Reasoning	CodeCode Available	1	5
Explainable GeoAI: Can saliency maps help interpret artificial intelligence's learning process? An empirical study on natural feature detection	Mar 16, 2023	Deep LearningObject Recognition	CodeCode Available	1	5
Efficient Attention: Attention with Linear Complexities	Dec 4, 2018	Depth EstimationExtractive Text Summarization	CodeCode Available	1	5
Full-Glow: Fully conditional Glow for more realistic image generation	Dec 10, 2020	Image GenerationObject Recognition	CodeCode Available	1	5
Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition	Mar 23, 2022	Event-based visionObject Recognition	CodeCode Available	1	5
EventCLIP: Adapting CLIP for Event-based Object Recognition	Jun 10, 2023	Few-Shot LearningObject	CodeCode Available	1	5
Expanding Event Modality Applications through a Robust CLIP-Based Encoder	Dec 4, 2024	Few-Shot LearningObject Recognition	CodeCode Available	1	5
Equalization Loss for Long-Tailed Object Recognition	Mar 11, 2020	Long-tail LearningObject	CodeCode Available	1	5
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models	Mar 26, 2020	DiversityImage Captioning	CodeCode Available	1	5
Empirical Upper Bound, Error Diagnosis and Invariance Analysis of Modern Object Detectors	Apr 5, 2020	Objectobject-detection	CodeCode Available	1	5
EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation	Nov 24, 2021	Event-based Object SegmentationKnowledge Distillation	CodeCode Available	1	5
Explainability-Aware One Point Attack for Point Cloud Neural Networks	Oct 8, 2021	3D Object RecognitionAdversarial Robustness	CodeCode Available	1	5
Domain Generalization for Object Recognition with Multi-task Autoencoders	Aug 31, 2015	DenoisingDomain Generalization	CodeCode Available	1	5
Attribution in Scale and Space	Apr 3, 2020	AttributeObject Recognition	CodeCode Available	1	5
Doubly Right Object Recognition: A Why Prompt for Visual Rationales	Dec 12, 2022	Object Recognition	CodeCode Available	1	5
Do Adversarially Robust ImageNet Models Transfer Better?	Jul 16, 2020	Object RecognitionTransfer Learning	CodeCode Available	1	5
Billion-scale semi-supervised learning for image classification	May 2, 2019	ClassificationGeneral Classification	CodeCode Available	1	5
Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like?	Jan 2, 2020	Object Recognition	CodeCode Available	1	5
Enriching ImageNet with Human Similarity Judgments and Psychological Embeddings	Nov 22, 2020	Bayesian InferenceObject Recognition	CodeCode Available	1	5
Event-based Asynchronous Sparse Convolutional Networks	Mar 20, 2020	object-detectionObject Detection	CodeCode Available	1	5
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video	Sep 25, 2022	Long-tail Video Object SegmentationMulti-Object Tracking	CodeCode Available	1	5
EventRPG: Event Data Augmentation with Relevance Propagation Guidance	Mar 14, 2024	Action RecognitionData Augmentation	CodeCode Available	1	5
Evolving Deep Neural Networks	Mar 1, 2017	Deep LearningImage Captioning	CodeCode Available	1	5
CLoVe: Encoding Compositional Language in Contrastive Vision-Language Models	Feb 22, 2024	Object RecognitionRetrieval	CodeCode Available	1	5
Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification	Aug 15, 2024	image-classificationImage Classification	CodeCode Available	1	5
Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition	Mar 28, 2020	ObjectObject Recognition	CodeCode Available	1	5
Causal Transportability for Visual Recognition	Apr 26, 2022	image-classificationImage Classification	CodeCode Available	1	5
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed Data	Dec 14, 2023	Contrastive LearningFederated Learning	CodeCode Available	1	5
FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery	Mar 9, 2021	Deep LearningObject	CodeCode Available	1	5
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation	Aug 13, 2020	ClassificationFew-Shot Object Detection	CodeCode Available	1	5
From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection	Apr 26, 2023	Event-based visionobject-detection	CodeCode Available	1	5
Divergences in Color Perception between Deep Neural Networks and Humans	Sep 11, 2023	image-classificationImage Classification	CodeCode Available	1	5
CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Oct 2, 2024	AstronomyImage Quality Assessment	CodeCode Available	1	5
Compact Generalized Non-local Network	Oct 31, 2018	Object DetectionObject Recognition	CodeCode Available	1	5
Comics Datasets Framework: Mix of Comics datasets for detection benchmarking	Jul 3, 2024	BenchmarkingObject	CodeCode Available	1	5
DOCTOR: A Simple Method for Detecting Misclassification Errors	Jun 4, 2021	Object RecognitionSentiment Analysis	CodeCode Available	1	5
Dual-Hybrid Attention Network for Specular Highlight Removal	Jul 17, 2024	highlight removalObject Recognition	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 41Next →

All datasets shape bias CIFAR10-DVS N-Caltech 101 ObjectNet (All classes)ObjectNet (ImageNet classes)ObjectNet (ImageNet classes, trained on ImageNet)DVS128 Gesture MECCANO N-CARS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Imagen	shape bias	98.7	—	Unverified
2	Stable Diffusion	shape bias	92.7	—	Unverified
3	Parti	shape bias	91.7	—	Unverified
4	ViT-22B-384	shape bias	86.4	—	Unverified
5	ViT-22B-560	shape bias	83.8	—	Unverified
6	CLIP (ViT-B)	shape bias	79.9	—	Unverified
7	ViT-22B-224	shape bias	78	—	Unverified
8	ResNet-50 (L2 eps 5.0 adv trained)	shape bias	69.5	—	Unverified
9	ResNet-50 (with strong augmentations)	shape bias	62.2	—	Unverified
10	SWSL (ResNeXt-101)	shape bias	49.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.55	—	Unverified
2	SSNN	Accuracy (% )	78.57	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	85.62	—	Unverified
2	SSNN	Accuracy (% )	79.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	18.75	—	Unverified
2	yun	Top 5 Accuracy	14.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	DY	Top 5 Accuracy	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ObjectNet-Baseline	Top 5 Accuracy	52.24	—	Unverified
2	AJ2021	Top 5 Accuracy	27.68	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SSNN	Accuracy (% )	94.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Faster-RCNN	mAP	30.39	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Spike-VGG11	Accuracy (% )	96	—	Unverified