Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge Nov 21, 2023 Large Language Model Multimodal Deep Learning
Code Code Available 15 Collaborative Transformers for Grounded Situation Recognition Mar 30, 2022 Grounded Situation Recognition Image Classification
Code Code Available 15 Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments Jul 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 15 Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis Mar 9, 2021 3d scene graph generation graph construction
Code Code Available 15 Event-aided Semantic Scene Completion Feb 4, 2025 Autonomous Driving Scene Understanding
Code Code Available 15 Event-based Motion Segmentation with Spatio-Temporal Graph Cuts Dec 16, 2020 Motion Segmentation Scene Understanding
Code Code Available 15 A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion Jun 14, 2024 3D Reconstruction Autonomous Driving
Code Code Available 15 Context Prior for Scene Segmentation Apr 3, 2020 Scene Segmentation Scene Understanding
Code Code Available 15 Dual-Hybrid Attention Network for Specular Highlight Removal Jul 17, 2024 highlight removal Object Recognition
Code Code Available 15 Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 15 Dynamic Graph Message Passing Networks Aug 19, 2019 Image Classification object-detection
Code Code Available 15 Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 15 NODIS: Neural Ordinary Differential Scene Understanding Jan 14, 2020 All Graph Generation
Code Code Available 15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Jul 15, 2024 All Image Retrieval
Code Code Available 15 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments May 23, 2024 Pose Estimation Scene Understanding
Code Code Available 15 A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 15 Estimating Generic 3D Room Structures from 2D Annotations Jun 15, 2023 Scene Understanding
Code Code Available 15 Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms Sep 27, 2023 object-detection Object Detection
Code Code Available 15 ODAM: Object Detection, Association, and Mapping using Posed RGB Video Aug 23, 2021 3D Object Detection Graph Neural Network
Code Code Available 15 MSeg: A Composite Dataset for Multi-domain Semantic Segmentation Dec 27, 2021 Computational Efficiency Instance Segmentation
Code Code Available 15 AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans Mar 24, 2024 3D Instance Segmentation Instance Segmentation
Code Code Available 15 Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding Mar 20, 2025 Scene Understanding
Code Code Available 15 A Survey on Deep Learning Technique for Video Segmentation Jul 2, 2021 Autonomous Driving Deep Learning
Code Code Available 15 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Jul 2, 2024 Boundary Detection Human Parsing
Code Code Available 15 Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor Setups Jan 12, 2021 Scene Understanding
Code Code Available 15 4D Panoptic LiDAR Segmentation Feb 24, 2021 4D Panoptic Segmentation Benchmarking
Code Code Available 15 DPF: Learning Dense Prediction Fields with Weak Supervision Mar 29, 2023 Intrinsic Image Decomposition Prediction
Code Code Available 15 Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding Jan 5, 2019 Domain Adaptation Scene Understanding
Code Code Available 15 MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based Decoders Aug 27, 2024 Decoder Mamba
Code Code Available 15 From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 15 All-Day Multi-Camera Multi-Target Tracking Jan 1, 2025 All Mamba
Code Code Available 15 CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery Jul 11, 2023 Question Answering Scene Understanding
Code Code Available 15 A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence Jun 22, 2020 Deep Learning Scene Understanding
Code Code Available 15 DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion Sep 18, 2024 Infrared And Visible Image Fusion Scene Understanding
Code Code Available 15 MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud Jul 28, 2022 Scene Understanding
Code Code Available 15 A Survey of World Models for Autonomous Driving Jan 20, 2025 Anomaly Detection Autonomous Driving
Code Code Available 15 Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality Mar 11, 2021 Scene Understanding Time Series
Code Code Available 15 Monte Carlo Scene Search for 3D Scene Understanding Mar 14, 2021 Scene Understanding
Code Code Available 15 Distilled Semantics for Comprehensive Scene Understanding from Videos Mar 31, 2020 Depth Estimation Knowledge Distillation
Code Code Available 15 DIP: Unsupervised Dense In-Context Post-training of Visual Representations Jun 23, 2025 GPU Meta-Learning
Code Code Available 15 AeroRIT: A New Scene for Hyperspectral Image Analysis Dec 17, 2019 Hyperspectral image analysis Image Super-Resolution
Code Code Available 15 General Geometry-aware Weakly Supervised 3D Object Detection Jul 18, 2024 3D Object Detection Object
Code Code Available 15 DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection Dec 25, 2023 3D Object Detection object-detection
Code Code Available 15 Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model Oct 25, 2020 Depth Estimation Depth Prediction
Code Code Available 15 Digging Into Self-Supervised Monocular Depth Estimation Jun 4, 2018 Camera Pose Estimation Depth Estimation
Code Code Available 15 GFF: Gated Fully Fusion for Semantic Segmentation Apr 3, 2019 Scene Parsing Scene Understanding
Code Code Available 15 Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding Jan 28, 2022 Graph Attention Knowledge Distillation
Code Code Available 15 DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction May 9, 2024 Contrastive Learning Scene Understanding
Code Code Available 15 CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP Jan 12, 2023 3D Semantic Segmentation Contrastive Learning
Code Code Available 15 Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation Dec 22, 2021 Common Sense Reasoning Question Answering
Code Code Available 15