Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework Jul 11, 2025 Clustering Crowd Counting
Code Code Available 0OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Jun 3, 2025 Object Counting Spatial Reasoning
— Unverified 0Improving Contrastive Learning for Referring Expression Counting May 28, 2025 Contrastive Learning Object Counting
Code Code Available 0InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition May 21, 2025 Earth Observation Object
Code Code Available 2Expanding Zero-Shot Object Counting with Rich Prompts May 21, 2025 Object Object Counting
— Unverified 0Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning? May 17, 2025 Hallucination Object Counting
— Unverified 0VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning May 17, 2025 2D Object Detection Object Counting
Code Code Available 4Learning What NOT to Count Apr 16, 2025 Object Counting Zero-Shot Counting
— Unverified 0Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment Apr 10, 2025 AI Agent Attribute
— Unverified 0MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams Mar 26, 2025 Mathematical Reasoning Object Counting
— Unverified 0A Causal Lens for Evaluating Faithfulness Metrics Feb 26, 2025 Decision Making Fact Checking
— Unverified 0Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding Feb 17, 2025 Arithmetic Reasoning Chart Understanding
— Unverified 0FocalCount: Towards Class-Count Imbalance in Class-Agnostic Counting Feb 15, 2025 Object Object Counting
— Unverified 0SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting Feb 10, 2025 Exemplar-Free Counting Object
Code Code Available 1AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis Feb 3, 2025 Object Counting Scene Understanding
— Unverified 0A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches Jan 31, 2025 Object Counting
— Unverified 0Mamba-MOC: A Multicategory Remote Object Counting via State Space Model Jan 12, 2025 Mamba Object
— Unverified 0Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension Jan 2, 2025 Generalized Referring Expression Comprehension Generalized Referring Expression Segmentation
— Unverified 0T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting Jan 1, 2025 Denoising Object Counting
Code Code Available 1Vision Transformers for Weakly-Supervised Microorganism Enumeration Dec 3, 2024 Density Estimation Instance Segmentation
Code Code Available 0GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks Nov 28, 2024 Benchmarking Object Counting
Code Code Available 2Counting Stacked Objects from Multi-View Images Nov 28, 2024 3D geometry Object Counting
— Unverified 0Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark Nov 20, 2024 Object Counting Optical Flow Estimation
— Unverified 0Boundary Attention Constrained Zero-Shot Layout-To-Image Generation Nov 15, 2024 Image Generation Layout-to-Image Generation
— Unverified 0A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation Sep 27, 2024 Exemplar-Free Counting Few-shot Object Counting and Detection
Code Code Available 2Mind the Prompt: A Novel Benchmark for Prompt-based Class-Agnostic Counting Sep 24, 2024 Object Object Counting
Code Code Available 1GCA-SUNet: A Gated Context-Aware Swin-UNet for Exemplar-Free Counting Sep 18, 2024 Decoder Exemplar-Free
Code Code Available 0Dense Center-Direction Regression for Object Counting and Localization with Point Supervision Aug 26, 2024 Object Object Counting
Code Code Available 0Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models Aug 21, 2024 Denoising Image Generation
— Unverified 0Mutually-Aware Feature Learning for Few-Shot Object Counting Aug 19, 2024 Object Counting
— Unverified 0Zero-shot Object Counting with Good Exemplars Jul 6, 2024 Contrastive Learning Object
Code Code Available 1CountGD: Multi-Modal Open-World Counting Jul 5, 2024 Object Counting Open-vocabulary object counting
Code Code Available 3RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent Jun 11, 2024 AI Agent Descriptive
Code Code Available 2Learning Spatial Similarity Distribution for Few-shot Object Counting May 20, 2024 Object Counting
Code Code Available 0Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models May 5, 2024 Object Counting
— Unverified 0DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting Apr 25, 2024 Exemplar-Free Counting Few-shot Object Counting and Detection
Code Code Available 2ChatGPT and general-purpose AI count fruits in pictures surprisingly well Apr 12, 2024 Deep Learning Few-Shot Learning
— Unverified 0Counting Objects in a Robotic Hand Apr 9, 2024 Contrastive Learning Object
— Unverified 0Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis Mar 28, 2024 Change Detection Language Modelling
Code Code Available 2Few-shot Object Localization Mar 19, 2024 Model Optimization Object
Code Code Available 1Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Mar 14, 2024 Object Object Counting
Code Code Available 0TFCounter:Polishing Gems for Training-Free Object Counting Mar 12, 2024 Management Object
— Unverified 0OmniCount: Multi-label Object Counting with Semantic-Geometric Priors Mar 8, 2024 Object Object Counting
— Unverified 0AFreeCA: Annotation-Free Counting for All Mar 7, 2024 All Object
Code Code Available 0Effectiveness Assessment of Recent Large Vision-Language Models Mar 7, 2024 Anomaly Detection Attribute
— Unverified 0A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video Mar 6, 2024 Benchmarking Crowd Counting
— Unverified 0Enhancing Zero-shot Counting via Language-guided Exemplar Learning Feb 8, 2024 Object Counting Zero-Shot Counting
— Unverified 0Do Object Detection Localization Errors Affect Human Performance and Trust? Jan 31, 2024 Object Object Counting
— Unverified 0Diffusion-based Data Augmentation for Object Counting Problems Jan 25, 2024 Crowd Counting Data Augmentation
— Unverified 0NWPU-MOC: A Benchmark for Fine-grained Multi-category Object Counting in Aerial Images Jan 19, 2024 Object Object Counting
Code Code Available 1