SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Dec 21, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0Demystifying the Potential of ChatGPT-4 Vision for Construction Progress Monitoring Dec 20, 2024 Object Localization
— Unverified 0SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Dec 13, 2024 GPU Object Localization
— Unverified 03D Spatial Understanding in MLLMs: Disambiguation and Evaluation Dec 9, 2024 3D dense captioning 3D visual grounding
— Unverified 0SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding Dec 5, 2024 3D visual grounding Object Localization
— Unverified 0GraPix: Exploring Graph Modularity Optimization for Unsupervised Pixel Clustering Dec 4, 2024 Attribute Clustering
Code Code Available 0RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations Dec 2, 2024 Object Localization
— Unverified 0SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection Nov 29, 2024 3D Multi-Object Tracking 3D Object Detection
Code Code Available 0ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos Nov 28, 2024 Object Object Localization
— Unverified 0GloFinder: AI-empowered QuPath Plugin for WSI-level Glomerular Detection, Visualization, and Curation Nov 27, 2024 Object Localization whole slide images
— Unverified 0Probing the Mid-level Vision Capabilities of Self-Supervised Learning Nov 25, 2024 Object Localization Self-Supervised Learning
— Unverified 0Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot Nov 22, 2024 Object Localization Task Planning
— Unverified 0FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Nov 20, 2024 Dimensionality Reduction GPU
— Unverified 0YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization Nov 20, 2024 2D Object Detection Autonomous Driving
Code Code Available 0Text-guided Zero-Shot Object Localization Nov 18, 2024 Object Object Localization
— Unverified 0Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Nov 15, 2024 Descriptive Object
— Unverified 0LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes Oct 18, 2024 3D geometry object-detection
— Unverified 0Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification Oct 17, 2024 Object Localization Sketch Recognition
Code Code Available 0Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation Oct 16, 2024 Multi-Task Learning Object Localization
— Unverified 0Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Oct 8, 2024 Instance Segmentation Object
— Unverified 0DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Sep 24, 2024 Contrastive Learning Object Localization
— Unverified 0QUB-PHEO: A Visual-Based Dyadic Multi-View Dataset for Intention Inference in Collaborative Assembly Sep 23, 2024 Object Localization
Code Code Available 0PMR-Net: Parallel Multi-Resolution Encoder-Decoder Network Framework for Medical Image Segmentation Sep 19, 2024 Decoder Image Segmentation
— Unverified 0Do Pre-trained Vision-Language Models Encode Object States? Sep 16, 2024 Language Modeling Language Modelling
Code Code Available 0Top-GAP: Integrating Size Priors in CNNs for more Interpretability, Robustness, and Bias Mitigation Sep 7, 2024 Object Localization
— Unverified 0Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift Sep 5, 2024 Autonomous Driving Benchmarking
— Unverified 0Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems Sep 3, 2024 image-classification Image Classification
Code Code Available 0Multi-scale Multi-instance Visual Sound Localization and Segmentation Aug 31, 2024 Object Localization
— Unverified 0Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging Aug 30, 2024 Diagnostic Image Segmentation
— Unverified 0Optimal Weight Scheme for Fusion-Assisted Cooperative Multi-Monostatic Object Localization in 6G Networks Aug 29, 2024 Object Localization
— Unverified 0Multi-Beam Object-Localization for Millimeter-Wave ISAC-Aided Connected Autonomous Vehicles Aug 26, 2024 Autonomous Vehicles Integrated sensing and communication
— Unverified 0Stimulating Imagination: Towards General-purpose Object Rearrangement Aug 3, 2024 Object Object Localization
— Unverified 0Categorical Knowledge Fused Recognition: Fusing Hierarchical Knowledge with Image Classification through Aligning and Deep Metric Learning Jul 30, 2024 Classification image-classification
— Unverified 0A Model Generalization Study in Localizing Indoor Cows with COw LOcalization (COLO) dataset Jul 29, 2024 Data Augmentation Object Localization
— Unverified 0BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments Jul 25, 2024 Object Localization
— Unverified 0PEEKABOO: Hiding parts of an image for unsupervised object localization Jul 24, 2024 Object object-detection
Code Code Available 0DenseTrack: Drone-based Crowd Tracking via Density-aware Motion-appearance Synergy Jul 24, 2024 Crowd Counting Language Modeling
Code Code Available 0Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks Jul 18, 2024 Hallucination object-detection
— Unverified 0Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained Videos Jul 8, 2024 Object Localization Weakly-Supervised Object Localization
Code Code Available 0ALINA: Advanced Line Identification and Notation Algorithm Jun 13, 2024 Lane Labeling Object Localization
Code Code Available 0FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal Sensors Jun 10, 2024 Object Localization
Code Code Available 0Leveraging Activations for Superpixel Explanations Jun 7, 2024 Object Localization Superpixels
— Unverified 0Explaining Multi-modal Large Language Models by Analyzing their Vision Perception May 23, 2024 Object Localization
Code Code Available 0Concept Visualization: Explaining the CLIP Multi-modal Embedding Using WordNet May 23, 2024 Object Localization Out-of-Distribution Detection
Code Code Available 0Masked Multi-Query Slot Attention for Unsupervised Object Discovery Apr 30, 2024 Object object-detection
Code Code Available 0Source-Free Domain Adaptation of Weakly-Supervised Object Localization Models for Histology Apr 29, 2024 Contrastive Learning Domain Adaptation
Code Code Available 0Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection Apr 17, 2024 3D Object Detection Object
— Unverified 0A Realistic Protocol for Evaluation of Weakly Supervised Object Localization Apr 15, 2024 Model Selection Object
Code Code Available 0Real-world Instance-specific Image Goal Navigation: Bridging Domain Gaps via Contrastive Learning Apr 15, 2024 Contrastive Learning Deblurring
— Unverified 0Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label Apr 15, 2024 Object Object Localization
— Unverified 0