Object Counting

The goal of Object Counting task is to count the number of object instances in a single image or video sequence. It has many real-world applications such as traffic flow monitoring, crowdedness estimation, and product counting.

Source: Learning to Count Objects with Few Exemplar Annotations

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 158 papers

Title	Date	Tasks	Status	Hype
Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework	Jul 11, 2025	ClusteringCrowd Counting	CodeCode Available	0
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models	Jun 3, 2025	Object CountingSpatial Reasoning	—Unverified	0
Improving Contrastive Learning for Referring Expression Counting	May 28, 2025	Contrastive LearningObject Counting	CodeCode Available	0
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition	May 21, 2025	Earth ObservationObject	CodeCode Available	2
Expanding Zero-Shot Object Counting with Rich Prompts	May 21, 2025	ObjectObject Counting	—Unverified	0
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?	May 17, 2025	HallucinationObject Counting	—Unverified	0
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning	May 17, 2025	2D Object DetectionObject Counting	CodeCode Available	4
Learning What NOT to Count	Apr 16, 2025	Object CountingZero-Shot Counting	—Unverified	0
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment	Apr 10, 2025	AI AgentAttribute	—Unverified	0
MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams	Mar 26, 2025	Mathematical ReasoningObject Counting	—Unverified	0
A Causal Lens for Evaluating Faithfulness Metrics	Feb 26, 2025	Decision MakingFact Checking	—Unverified	0
Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding	Feb 17, 2025	Arithmetic ReasoningChart Understanding	—Unverified	0
FocalCount: Towards Class-Count Imbalance in Class-Agnostic Counting	Feb 15, 2025	ObjectObject Counting	—Unverified	0
SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting	Feb 10, 2025	Exemplar-Free CountingObject	CodeCode Available	1
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis	Feb 3, 2025	Object CountingScene Understanding	—Unverified	0
A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches	Jan 31, 2025	Object Counting	—Unverified	0
Mamba-MOC: A Multicategory Remote Object Counting via State Space Model	Jan 12, 2025	MambaObject	—Unverified	0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension	Jan 2, 2025	Generalized Referring Expression ComprehensionGeneralized Referring Expression Segmentation	—Unverified	0
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting	Jan 1, 2025	DenoisingObject Counting	CodeCode Available	1
Vision Transformers for Weakly-Supervised Microorganism Enumeration	Dec 3, 2024	Density EstimationInstance Segmentation	CodeCode Available	0
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks	Nov 28, 2024	BenchmarkingObject Counting	CodeCode Available	2
Counting Stacked Objects from Multi-View Images	Nov 28, 2024	3D geometryObject Counting	—Unverified	0
Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark	Nov 20, 2024	Object CountingOptical Flow Estimation	—Unverified	0
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation	Nov 15, 2024	Image GenerationLayout-to-Image Generation	—Unverified	0
A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation	Sep 27, 2024	Exemplar-Free CountingFew-shot Object Counting and Detection	CodeCode Available	2
Mind the Prompt: A Novel Benchmark for Prompt-based Class-Agnostic Counting	Sep 24, 2024	ObjectObject Counting	CodeCode Available	1
GCA-SUNet: A Gated Context-Aware Swin-UNet for Exemplar-Free Counting	Sep 18, 2024	DecoderExemplar-Free	CodeCode Available	0
Dense Center-Direction Regression for Object Counting and Localization with Point Supervision	Aug 26, 2024	ObjectObject Counting	CodeCode Available	0
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models	Aug 21, 2024	DenoisingImage Generation	—Unverified	0
Mutually-Aware Feature Learning for Few-Shot Object Counting	Aug 19, 2024	Object Counting	—Unverified	0
Zero-shot Object Counting with Good Exemplars	Jul 6, 2024	Contrastive LearningObject	CodeCode Available	1
CountGD: Multi-Modal Open-World Counting	Jul 5, 2024	Object CountingOpen-vocabulary object counting	CodeCode Available	3
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent	Jun 11, 2024	AI AgentDescriptive	CodeCode Available	2
Learning Spatial Similarity Distribution for Few-shot Object Counting	May 20, 2024	Object Counting	CodeCode Available	0
Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models	May 5, 2024	Object Counting	—Unverified	0
DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting	Apr 25, 2024	Exemplar-Free CountingFew-shot Object Counting and Detection	CodeCode Available	2
ChatGPT and general-purpose AI count fruits in pictures surprisingly well	Apr 12, 2024	Deep LearningFew-Shot Learning	—Unverified	0
Counting Objects in a Robotic Hand	Apr 9, 2024	Contrastive LearningObject	—Unverified	0
Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis	Mar 28, 2024	Change DetectionLanguage Modelling	CodeCode Available	2
Few-shot Object Localization	Mar 19, 2024	Model OptimizationObject	CodeCode Available	1
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring	Mar 14, 2024	ObjectObject Counting	—Unverified	0
TFCounter:Polishing Gems for Training-Free Object Counting	Mar 12, 2024	ManagementObject	—Unverified	0
OmniCount: Multi-label Object Counting with Semantic-Geometric Priors	Mar 8, 2024	ObjectObject Counting	—Unverified	0
AFreeCA: Annotation-Free Counting for All	Mar 7, 2024	AllObject	CodeCode Available	0
Effectiveness Assessment of Recent Large Vision-Language Models	Mar 7, 2024	Anomaly DetectionAttribute	—Unverified	0
A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video	Mar 6, 2024	BenchmarkingCrowd Counting	—Unverified	0
Enhancing Zero-shot Counting via Language-guided Exemplar Learning	Feb 8, 2024	Object CountingZero-Shot Counting	—Unverified	0
Do Object Detection Localization Errors Affect Human Performance and Trust?	Jan 31, 2024	ObjectObject Counting	—Unverified	0
Diffusion-based Data Augmentation for Object Counting Problems	Jan 25, 2024	Crowd CountingData Augmentation	—Unverified	0
NWPU-MOC: A Benchmark for Fine-grained Multi-category Object Counting in Aerial Images	Jan 19, 2024	ObjectObject Counting	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 4Next →

All datasets FSC147 CARPK Pascal VOC 2007 count-test COCO count-test TallyQA-Complex TallyQA-Simple HowMany-QA PASCAL VOC Omnicount-191 TRANCOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	FamNet	MAE(test)	22.08	—	Unverified
2	Omnicount (Open vocabulary, multi-label, without training)	MAE(test)	18.63	—	Unverified
3	RCC	MAE(test)	17.12	—	Unverified
4	Counting-DETR	MAE(test)	16.79	—	Unverified
5	CounTX (uses text descriptions instead of visual exemplars)	MAE(test)	15.88	—	Unverified
6	LaoNet	MAE(test)	15.78	—	Unverified
7	BMNet+	MAE(test)	14.62	—	Unverified
8	SAFECount	MAE(test)	14.32	—	Unverified
9	GCA-SUN	MAE(test)	14	—	Unverified
10	SPDCN	MAE(test)	13.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	YOLO (2016)	MAE	156	—	Unverified
2	YOLO9000opt (2017)	MAE	130.4	—	Unverified
3	Faster R-CNN (2015)	MAE	39.88	—	Unverified
4	RetinaNet (2018)	MAE	24.58	—	Unverified
5	LPN Counting (2017)	MAE	22.76	—	Unverified
6	One-Look Regression (2016)	MAE	21.88	—	Unverified
7	RetinaNet (2018)	MAE	16.62	—	Unverified
8	CounTX (uses arbitrary text input to specify object to count, used "the cars" for CARPK)	MAE	8.13	—	Unverified
9	Soft-IoU + EM-Merger unit	MAE	6.77	—	Unverified
10	VLCounter	MAE	6.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fast-RCNN	m-reIRMSE-nz	0.85	—	Unverified
2	glance-noft-2L	m-reIRMSE-nz	0.73	—	Unverified
3	LC-PSPNet	m-reIRMSE-nz	0.7	—	Unverified
4	Seq-sub-ft-3x3	m-reIRMSE-nz	0.68	—	Unverified
5	ens	m-reIRMSE-nz	0.65	—	Unverified
6	Supervised Density Map	m-reIRMSE-nz	0.61	—	Unverified
7	LC-ResFCN	m-reIRMSE-nz	0.61	—	Unverified
8	Omnicount	mRMSE	0	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Aso-sub-ft-3x3	m-reIRMSE	0.24	—	Unverified
2	glance-ft-2L	m-reIRMSE	0.23	—	Unverified
3	Fast-RCNN	m-reIRMSE	0.2	—	Unverified
4	LC-ResFCN	m-reIRMSE	0.19	—	Unverified
5	Seq-sub-ft-3x3	m-reIRMSE	0.18	—	Unverified
6	Supervised Density Map	m-reIRMSE	0.18	—	Unverified
7	ens	m-reIRMSE	0.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SMoLA-PaLI-X Specialist	Accuracy	77.1	—	Unverified
2	PaLI-X-VPD	Accuracy	76.6	—	Unverified
3	SMoLA-PaLI-X Generalist (0 shot)	Accuracy	70.7	—	Unverified
4	MoVie-ResNeXt	Accuracy	56.8	—	Unverified
5	RCN	Accuracy	56.2	—	Unverified
6	MoVie	Accuracy	54.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SMoLA-PaLI-X Specialist	Accuracy	86.3	—	Unverified
2	PaLI-X-VPD	Accuracy	86.2	—	Unverified
3	SMoLA-PaLI-X Generalist (0 shot)	Accuracy	83.3	—	Unverified
4	MoVie-ResNeXt	Accuracy	74.9	—	Unverified
5	RCN	Accuracy	71.8	—	Unverified
6	MoVie	Accuracy	70.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MoVie-ResNeXt	Accuracy	64	—	Unverified
2	MoVie	Accuracy	61.2	—	Unverified
3	RCN	Accuracy	60.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CEOES	mRMSE	0.42	—	Unverified
2	ILC	mRMSE	0.29	—	Unverified
3	TFOC	mRMSE	0.01	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Omnicount	mRMSE	0	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GauNet (ResNet-50)	MAE	2.1	—	Unverified