SOTAVerified

Object Counting

The goal of Object Counting task is to count the number of object instances in a single image or video sequence. It has many real-world applications such as traffic flow monitoring, crowdedness estimation, and product counting.

Source: Learning to Count Objects with Few Exemplar Annotations

Papers

Showing 101150 of 158 papers

TitleStatusHype
OmniCount: Multi-label Object Counting with Semantic-Geometric Priors0
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts0
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models0
On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications0
Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark0
Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models0
People, Penguins and Petri Dishes: Adapting Object Counting Models To New Visual Domains And Object Types Without Forgetting0
Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map0
Effectiveness Assessment of Recent Large Vision-Language Models0
Dynamic Conditional Networks for Few-Shot Learning0
Drone-based Object Counting by Spatially Regularized Regional Proposal Network0
Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning0
Do Object Detection Localization Errors Affect Human Performance and Trust?0
Distortion-Adaptive Grape Bunch Counting for Omnidirectional Images0
Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting0
Diffusion-based Data Augmentation for Object Counting Problems0
Reverse Perspective Network for Perspective-Aware Object Counting0
Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding0
Density Map Distillation for Incremental Object Counting0
DeepSetNet: Predicting Sets with Deep Neural Networks0
RSVQA: Visual Question Answering for Remote Sensing Data0
Deep-Learning-based Counting Methods, Datasets, and Applications in Agriculture -- A Review0
Counting the uncountable: deep semantic density estimation from Space0
Segmentação e contagem de troncos de madeira utilizando deep learning e processamento de imagens0
Counting Stacked Objects from Multi-View Images0
Counting Objects in a Robotic Hand0
Counting dense objects in remote sensing images0
SIMCO: SIMilarity-based object COunting0
Single Image Object Counting and Localizing using Active-Learning0
Counting and Locating High-Density Objects Using Convolutional Neural Network0
Contextual Hourglass Networks for Segmentation and Density Estimation0
CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection0
ChatGPT and general-purpose AI count fruits in pictures surprisingly well0
A Few-Shot Sequential Approach for Object Counting0
0-1 phase transitions in sparse spiked matrix estimation0
TFCounter:Polishing Gems for Training-Free Object Counting0
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation0
Towards Locally Consistent Object Counting with Constrained Multi-stage Convolutional Neural Networks0
A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video0
A-CCNN: adaptive ccnn for density estimation and crowd counting0
A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches0
T-Rex: Counting by Visual Prompting0
TS4Net: Two-Stage Sample Selective Strategy for Rotating Object Detection0
Understanding the Ability of Deep Neural Networks to Count Connected Components in Images0
A Statistical Method for Object Counting0
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?0
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis0
A Causal Lens for Evaluating Faithfulness Metrics0
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models0
An Image Processing based Object Counting Approach for Machine Vision Application0
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FamNetMAE(test)22.08Unverified
2Omnicount (Open vocabulary, multi-label, without training)MAE(test)18.63Unverified
3RCCMAE(test)17.12Unverified
4Counting-DETRMAE(test)16.79Unverified
5CounTX (uses text descriptions instead of visual exemplars)MAE(test)15.88Unverified
6LaoNetMAE(test)15.78Unverified
7BMNet+MAE(test)14.62Unverified
8SAFECountMAE(test)14.32Unverified
9GCA-SUNMAE(test)14Unverified
10SPDCNMAE(test)13.51Unverified
#ModelMetricClaimedVerifiedStatus
1YOLO (2016)MAE156Unverified
2YOLO9000opt (2017)MAE130.4Unverified
3Faster R-CNN (2015)MAE39.88Unverified
4RetinaNet (2018)MAE24.58Unverified
5LPN Counting (2017)MAE22.76Unverified
6One-Look Regression (2016)MAE21.88Unverified
7RetinaNet (2018)MAE16.62Unverified
8CounTX (uses arbitrary text input to specify object to count, used "the cars" for CARPK)MAE8.13Unverified
9Soft-IoU + EM-Merger unitMAE6.77Unverified
10VLCounterMAE6.46Unverified
#ModelMetricClaimedVerifiedStatus
1Fast-RCNNm-reIRMSE-nz0.85Unverified
2glance-noft-2Lm-reIRMSE-nz0.73Unverified
3LC-PSPNetm-reIRMSE-nz0.7Unverified
4Seq-sub-ft-3x3m-reIRMSE-nz0.68Unverified
5ensm-reIRMSE-nz0.65Unverified
6Supervised Density Mapm-reIRMSE-nz0.61Unverified
7LC-ResFCNm-reIRMSE-nz0.61Unverified
8OmnicountmRMSE0Unverified
#ModelMetricClaimedVerifiedStatus
1Aso-sub-ft-3x3m-reIRMSE0.24Unverified
2glance-ft-2Lm-reIRMSE0.23Unverified
3Fast-RCNNm-reIRMSE0.2Unverified
4LC-ResFCNm-reIRMSE0.19Unverified
5Seq-sub-ft-3x3m-reIRMSE0.18Unverified
6Supervised Density Mapm-reIRMSE0.18Unverified
7ensm-reIRMSE0.18Unverified
#ModelMetricClaimedVerifiedStatus
1SMoLA-PaLI-X SpecialistAccuracy77.1Unverified
2PaLI-X-VPDAccuracy76.6Unverified
3SMoLA-PaLI-X Generalist (0 shot)Accuracy70.7Unverified
4MoVie-ResNeXtAccuracy56.8Unverified
5RCNAccuracy56.2Unverified
6MoVieAccuracy54.1Unverified
#ModelMetricClaimedVerifiedStatus
1SMoLA-PaLI-X SpecialistAccuracy86.3Unverified
2PaLI-X-VPDAccuracy86.2Unverified
3SMoLA-PaLI-X Generalist (0 shot)Accuracy83.3Unverified
4MoVie-ResNeXtAccuracy74.9Unverified
5RCNAccuracy71.8Unverified
6MoVieAccuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1MoVie-ResNeXtAccuracy64Unverified
2MoVieAccuracy61.2Unverified
3RCNAccuracy60.3Unverified
#ModelMetricClaimedVerifiedStatus
1CEOESmRMSE0.42Unverified
2ILCmRMSE0.29Unverified
3TFOCmRMSE0.01Unverified
#ModelMetricClaimedVerifiedStatus
1OmnicountmRMSE0Unverified
#ModelMetricClaimedVerifiedStatus
1GauNet (ResNet-50)MAE2.1Unverified