LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition Apr 27, 2025 3D Object Recognition Object
— Unverified 0Disaggregated Deep Learning via In-Physics Computing at Radio Frequency Apr 24, 2025 Autonomous Navigation Deep Learning
— Unverified 0V^2R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations Apr 23, 2025 Dataset Generation Object Recognition
— Unverified 0Naturally Computed Scale Invariance in the Residual Stream of ResNet18 Apr 22, 2025 Object Recognition
Code Code Available 0Quantum Doubly Stochastic Transformers Apr 22, 2025 Inductive Bias Object Recognition
— Unverified 0DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment Apr 16, 2025 Language Modeling Language Modelling
— Unverified 0Visual Language Models show widespread visual deficits on neuropsychological tests Apr 15, 2025 Object Recognition Visual Reasoning
— Unverified 0MASSeg : 2nd Technical Report for 4th PVUW MOSE Track Apr 14, 2025 Data Augmentation Object
Code Code Available 0Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review Apr 11, 2025 Object Recognition Optical Flow Estimation
— Unverified 0D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition Apr 8, 2025 Image Generation Object
— Unverified 0Advancing Egocentric Video Question Answering with Multimodal Large Language Models Apr 6, 2025 Object Recognition Question Answering
— Unverified 0Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users Mar 28, 2025 Object Recognition Reading Comprehension
— Unverified 0ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection Mar 28, 2025 Action Recognition Human-Object Interaction Detection
— Unverified 0Foveated Instance Segmentation Mar 27, 2025 Instance Segmentation Object Recognition
Code Code Available 0DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset Mar 27, 2025 Knowledge Distillation Object Recognition
— Unverified 0Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection Mar 26, 2025 Object Recognition Symmetry Detection
— Unverified 0MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection Mar 25, 2025 3DGS object-detection
— Unverified 0Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving Mar 24, 2025 Autonomous Driving Knowledge Graphs
— Unverified 0Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models Mar 21, 2025 Diagnostic Object Recognition
— Unverified 0TULIP: Towards Unified Language-Image Pretraining Mar 19, 2025 Contrastive Learning Data Augmentation
— Unverified 0Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation Mar 14, 2025 Object Recognition
— Unverified 0OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Mar 13, 2025 Object Recognition Semantic Segmentation
— Unverified 0Seeing What's Not There: Spurious Correlation in Multimodal LLMs Mar 11, 2025 Hallucination Object
— Unverified 0Object-Centric World Model for Language-Guided Manipulation Mar 8, 2025 Autonomous Driving model
— Unverified 0Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation Mar 5, 2025 Object Object Recognition
— Unverified 0Identity documents recognition and detection using semantic segmentation with convolutional neural network Mar 3, 2025 Object Recognition Semantic Segmentation
— Unverified 0Deep learning based infrared small object segmentation: Challenges and future directions Feb 20, 2025 Autonomous Vehicles Object Recognition
— Unverified 0RAPTOR: Refined Approach for Product Table Object Recognition Feb 19, 2025 Object Object Recognition
— Unverified 0"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models Feb 17, 2025 Object Recognition Question Answering
— Unverified 0Revealing Bias Formation in Deep Neural Networks Through the Geometric Mechanisms of Human Visual Decoupling Feb 17, 2025 Object Object Recognition
— Unverified 0Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition Feb 15, 2025 3D Object Recognition Object Recognition
— Unverified 0DCENWCNet: A Deep CNN Ensemble Network for White Blood Cell Classification with LIME-Based Explainability Feb 8, 2025 Data Augmentation Object Recognition
— Unverified 0Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics Jan 26, 2025 Object Recognition Scene Understanding
— Unverified 0Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities Jan 25, 2025 Hallucination Object
— Unverified 0Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement Jan 22, 2025 Object Recognition speech-recognition
— Unverified 0RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0AI-Powered Assistive Technologies for Visual Impairment Jan 14, 2025 Object Recognition text-to-speech
— Unverified 0Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 0Guided SAM: Label-Efficient Part Segmentation Jan 13, 2025 Object Object Recognition
— Unverified 0Hierarchical Superpixel Segmentation via Structural Information Theory Jan 13, 2025 graph construction graph partitioning
Code Code Available 0Perceptual Inductive Bias Is What You Need Before Contrastive Learning Jan 1, 2025 Contrastive Learning Depth Estimation
— Unverified 0Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models Jan 1, 2025 Attribute Diagnostic
— Unverified 0Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering Dec 30, 2024 Image Captioning Object Recognition
— Unverified 0Sample Correlation for Fingerprinting Deep Face Recognition Dec 30, 2024 Adversarial Defense Emotion Recognition
Code Code Available 0AI-based Wearable Vision Assistance System for the Visually Impaired: Integrating Real-Time Object Recognition and Contextual Understanding Using Large Vision-Language Models Dec 28, 2024 Object Recognition Raspberry Pi 4
— Unverified 0The same but different: impact of animal facility sanitary status on a transgenic mouse model of Alzheimer's disease Dec 24, 2024 Object Recognition
— Unverified 0SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Dec 21, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition Dec 18, 2024 Attribute Descriptive
Code Code Available 0Targeted View-Invariant Adversarial Perturbations for 3D Object Recognition Dec 17, 2024 3D Object Recognition Adversarial Robustness
Code Code Available 0Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images Dec 17, 2024 Computational Efficiency Object
— Unverified 0