MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection Mar 25, 2025 3DGS object-detection
— Unverified 0Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving Mar 24, 2025 Autonomous Driving Knowledge Graphs
— Unverified 0Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models Mar 21, 2025 Diagnostic Object Recognition
— Unverified 0TULIP: Towards Unified Language-Image Pretraining Mar 19, 2025 Contrastive Learning Data Augmentation
— Unverified 0Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation Mar 14, 2025 Object Recognition
— Unverified 0OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Mar 13, 2025 Object Recognition Semantic Segmentation
— Unverified 0Seeing What's Not There: Spurious Correlation in Multimodal LLMs Mar 11, 2025 Hallucination Object
— Unverified 0Object-Centric World Model for Language-Guided Manipulation Mar 8, 2025 Autonomous Driving model
— Unverified 0Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation Mar 5, 2025 Object Object Recognition
— Unverified 0Identity documents recognition and detection using semantic segmentation with convolutional neural network Mar 3, 2025 Object Recognition Semantic Segmentation
— Unverified 0Deep learning based infrared small object segmentation: Challenges and future directions Feb 20, 2025 Autonomous Vehicles Object Recognition
— Unverified 0RAPTOR: Refined Approach for Product Table Object Recognition Feb 19, 2025 Object Object Recognition
— Unverified 0Revealing Bias Formation in Deep Neural Networks Through the Geometric Mechanisms of Human Visual Decoupling Feb 17, 2025 Object Object Recognition
— Unverified 0"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models Feb 17, 2025 Object Recognition Question Answering
— Unverified 0Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition Feb 15, 2025 3D Object Recognition Object Recognition
— Unverified 0Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models Feb 12, 2025 Attribute Diagnostic
Code Code Available 1DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability Feb 8, 2025 Data Augmentation Object Recognition
— Unverified 0Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics Jan 26, 2025 Object Recognition Scene Understanding
— Unverified 0Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities Jan 25, 2025 Hallucination Object
— Unverified 0NUDT4MSTAR: A Large Dataset and Benchmark Towards Remote Sensing Object Recognition in the Wild Jan 23, 2025 Earth Observation Object Recognition
Code Code Available 2Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement Jan 22, 2025 Object Recognition speech-recognition
— Unverified 0RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0AI-Powered Assistive Technologies for Visual Impairment Jan 14, 2025 Object Recognition text-to-speech
— Unverified 0Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 0Guided SAM: Label-Efficient Part Segmentation Jan 13, 2025 Object Object Recognition
— Unverified 0Hierarchical Superpixel Segmentation via Structural Information Theory Jan 13, 2025 graph construction graph partitioning
Code Code Available 0Perceptual Inductive Bias Is What You Need Before Contrastive Learning Jan 1, 2025 Contrastive Learning Depth Estimation
— Unverified 0Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models Jan 1, 2025 Attribute Diagnostic
— Unverified 0Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering Dec 30, 2024 Image Captioning Object Recognition
— Unverified 0Sample Correlation for Fingerprinting Deep Face Recognition Dec 30, 2024 Adversarial Defense Emotion Recognition
Code Code Available 0AI-based Wearable Vision Assistance System for the Visually Impaired: Integrating Real-Time Object Recognition and Contextual Understanding Using Large Vision-Language Models Dec 28, 2024 Object Recognition Raspberry Pi 4
— Unverified 0The same but different: impact of animal facility sanitary status on a transgenic mouse model of Alzheimer's disease Dec 24, 2024 Object Recognition
— Unverified 0Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Dec 23, 2024 object-detection Object Detection
Code Code Available 1SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Dec 21, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition Dec 18, 2024 Attribute Descriptive
Code Code Available 0Targeted View-Invariant Adversarial Perturbations for 3D Object Recognition Dec 17, 2024 3D Object Recognition Adversarial Robustness
Code Code Available 0Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images Dec 17, 2024 Computational Efficiency Object
— Unverified 0CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics Dec 17, 2024 Object object-detection
Code Code Available 1WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model Dec 13, 2024 Autonomous Driving Decision Making
Code Code Available 1CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs Dec 11, 2024 Large Language Model Object
— Unverified 0Proactive Adversarial Defense: Harnessing Prompt Tuning in Vision-Language Models to Detect Unseen Backdoored Images Dec 11, 2024 Adversarial Defense backdoor defense
— Unverified 0Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis Dec 10, 2024 2D Object Detection 3D Object Detection
— Unverified 0Can foundation models actively gather information in interactive environments to test hypotheses? Dec 9, 2024 Object Recognition
— Unverified 0Expanding Event Modality Applications through a Robust CLIP-Based Encoder Dec 4, 2024 Few-Shot Learning Object Recognition
Code Code Available 1Optimized CNNs for Rapid 3D Point Cloud Object Recognition Dec 3, 2024 Computational Efficiency object-detection
— Unverified 0LVLM-COUNT: Enhancing the Counting Ability of Large Vision-Language Models Dec 1, 2024 Object Recognition
Code Code Available 0Textured As-Is BIM via GIS-informed Point Cloud Segmentation Nov 28, 2024 Object Recognition Point Cloud Segmentation
— Unverified 0Verbalized Representation Learning for Interpretable Few-Shot Generalization Nov 27, 2024 Language Modeling Language Modelling
Code Code Available 0Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents Nov 27, 2024 Autonomous Navigation Object Recognition
Code Code Available 0NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects? Nov 26, 2024 Attribute Multiple-choice
— Unverified 0