Robust Multi-Modal Image Stitching for Improved Scene Understanding Dec 28, 2023 Image Stitching Scene Understanding
— Unverified 0Cloud-Device Collaborative Learning for Multimodal Large Language Models Dec 26, 2023 Device-Cloud Collaboration Knowledge Distillation
— Unverified 0BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions Dec 21, 2023 Decoder Multi-Task Learning
— Unverified 0Object Attribute Matters in Visual Question Answering Dec 20, 2023 Attribute Graph Neural Network
Code Code Available 0AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model Dec 20, 2023 Autonomous Driving Scene Understanding
— Unverified 0Language-Assisted 3D Scene Understanding Dec 18, 2023 3D Object Detection 3D Semantic Segmentation
— Unverified 0Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment Dec 15, 2023 3D visual grounding Natural Language Queries
— Unverified 0Dietary Assessment with Multimodal ChatGPT: A Systematic Analysis Dec 14, 2023 Image Captioning Scene Understanding
— Unverified 0Zoom in on the Plant: Fine-grained Analysis of Leaf, Stem and Vein Instances Dec 14, 2023 Scene Understanding
Code Code Available 0VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding Dec 14, 2023 Scene Understanding Transfer Learning
— Unverified 0X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer Dec 12, 2023 Action Recognition Action Segmentation
Code Code Available 0Spatiotemporal Event Graphs for Dynamic Scene Understanding Dec 11, 2023 Action Detection Activity Detection
— Unverified 0Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection Dec 11, 2023 Benchmarking Domain Adaptation
— Unverified 0SkyScenes: A Synthetic Dataset for Aerial Scene Understanding Dec 11, 2023 Diversity Scene Understanding
— Unverified 0Prospective Role of Foundation Models in Advancing Autonomous Vehicles Dec 8, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0IGFNet: Illumination-Guided Fusion Network for Semantic Scene Understanding using RGB-Thermal Images Dec 4, 2023 Autonomous Driving Scene Understanding
Code Code Available 0A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors Dec 3, 2023 Active Learning Instance Segmentation
— Unverified 0Segment Any 3D Gaussians Dec 1, 2023 Interactive Segmentation Scene Understanding
— Unverified 0HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos Nov 28, 2023 Graph Generation Scene Graph Generation
— Unverified 0Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Nov 28, 2023 Clustering Diversity
— Unverified 0REACT: Recognize Every Action Everywhere All At Once Nov 27, 2023 Action Recognition Activity Recognition
— Unverified 0FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding Nov 27, 2023 Continual Learning Continual Semantic Segmentation
— Unverified 0Multi-task Planar Reconstruction with Feature Warping Guidance Nov 25, 2023 3D Reconstruction Instance Segmentation
Code Code Available 0GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction Nov 24, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Nov 20, 2023 Instance Segmentation NeRF
— Unverified 0SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles Nov 20, 2023 Change Detection Motion Planning
— Unverified 0Two Stream Scene Understanding on Graph Embedding Nov 12, 2023 Graph Attention Graph Embedding
— Unverified 0Continual Learning of Unsupervised Monocular Depth from Videos Nov 4, 2023 Autonomous Driving Continual Learning
Code Code Available 0Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Nov 3, 2023 3D Semantic Segmentation Point Cloud Segmentation
— Unverified 0Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture Nov 1, 2023 3D Object Reconstruction 3D Reconstruction
— Unverified 0Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation Oct 24, 2023 Autonomous Driving Scene Understanding
— Unverified 0P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 0Panoptic Out-of-Distribution Segmentation Oct 18, 2023 Data Augmentation Instance Segmentation
— Unverified 0S4C: Self-Supervised Semantic Scene Completion with Neural Fields Oct 11, 2023 Image Segmentation Navigate
— Unverified 0Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models Oct 10, 2023 Object Object Tracking
— Unverified 0TextPSG: Panoptic Scene Graph Generation from Textual Descriptions Oct 10, 2023 Graph Generation Panoptic Scene Graph Generation
— Unverified 0DualMLP: a two-stream fusion model for 3D point cloud classification Oct 10, 2023 3D Point Cloud Classification Point Cloud Classification
Code Code Available 0Adaptive Visual Scene Understanding: Incremental Scene Graph Generation Oct 2, 2023 Benchmarking Continual Learning
Code Code Available 0Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception Oct 2, 2023 Autonomous Driving Image Segmentation
— Unverified 0Logical Bias Learning for Object Relation Prediction Oct 1, 2023 Causal Inference Decision Making
— Unverified 0SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction Sep 27, 2023 Graph Learning Prediction
— Unverified 0Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding Sep 26, 2023 Scene Understanding Simultaneous Localization and Mapping
— Unverified 0SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset Sep 21, 2023 Autonomous Vehicles Depth Estimation
— Unverified 0LLMR: Real-time Prompting of Interactive Worlds using Large Language Models Sep 21, 2023 Language Modeling Language Modelling
— Unverified 0Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives Sep 21, 2023 Action Localization Action Recognition
— Unverified 0Shape Anchor Guided Holistic Indoor Scene Understanding Sep 20, 2023 3D Object Detection object-detection
Code Code Available 0PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding Sep 18, 2023 Data Augmentation Diversity
— Unverified 0So you think you can track? Sep 13, 2023 Benchmarking Object
— Unverified 0Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning Sep 12, 2023 Autonomous Vehicles Question Answering
— Unverified 0AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving Sep 12, 2023 Autonomous Driving Benchmarking
— Unverified 0