Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Mar 27, 2025 3D Semantic Segmentation Autonomous Driving
Code Code Available 25 Towards Open Vocabulary Learning: A Survey Jun 28, 2023 Open Set Learning Out-of-Distribution Detection
Code Code Available 25 Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding Nov 4, 2020 Multi-Task Learning Scene Understanding
Code Code Available 25 VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment Jan 3, 2025 Computational Efficiency Scene Understanding
Code Code Available 25 Volumetric Environment Representation for Vision-Language Navigation Mar 21, 2024 3D geometry Multi-Task Learning
Code Code Available 25 AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Dec 19, 2024 Autonomous Driving Benchmarking
Code Code Available 25 HAKE: A Knowledge Engine Foundation for Human Activity Understanding Feb 14, 2022 Action Recognition Human-Object Interaction Detection
Code Code Available 25 Grounded 3D-LLM with Referent Tokens May 16, 2024 Dense Captioning Diversity
Code Code Available 25 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Oct 17, 2024 3D Semantic Segmentation Image Generation
Code Code Available 25 Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion Jul 8, 2025 3D geometry Domain Generalization
Code Code Available 25 GroupViT: Semantic Segmentation Emerges from Text Supervision Feb 22, 2022 Object Detection Scene Understanding
Code Code Available 25 Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning Mar 1, 2025 Scene Understanding
Code Code Available 25 NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM Feb 16, 2025 Navigate RAG
Code Code Available 25 RelationField: Relate Anything in Radiance Fields Dec 18, 2024 3d scene graph generation Graph Generation
Code Code Available 25 Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding Apr 14, 2023 3D Object Detection Scene Understanding
Code Code Available 25 Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting Sep 19, 2024 Scene Understanding Semantic Segmentation
Code Code Available 25 Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks Feb 17, 2023 Deblurring Deep Learning
Code Code Available 15 Generating Visual Spatial Description via Holistic 3D Scene Understanding May 19, 2023 Scene Understanding Text Generation
Code Code Available 15 GFF: Gated Fully Fusion for Semantic Segmentation Apr 3, 2019 Scene Parsing Scene Understanding
Code Code Available 15 3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding Jan 6, 2024 Scene Understanding Visual Question Answering (VQA)
Code Code Available 15 A Review of Panoptic Segmentation for Mobile Mapping Point Clouds Apr 27, 2023 Instance Segmentation Panoptic Segmentation
Code Code Available 15 Advances in Deep Concealed Scene Understanding Apr 21, 2023 Scene Understanding Semantic Segmentation
Code Code Available 15 DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization Aug 24, 2021 Diversity Graph Neural Network
Code Code Available 15 General Geometry-aware Weakly Supervised 3D Object Detection Jul 18, 2024 3D Object Detection Object
Code Code Available 15 Global Aggregation then Local Distribution in Fully Convolutional Networks Sep 16, 2019 Instance Segmentation object-detection
Code Code Available 15 Deep learning for radar data exploitation of autonomous vehicle Mar 15, 2022 Autonomous Driving Deep Learning
Code Code Available 15 DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Apr 16, 2025 Few-Shot Learning Interactive Segmentation
Code Code Available 15 F-ViTA: Foundation Model Guided Visible to Thermal Translation Apr 3, 2025 Scene Understanding Style Transfer
Code Code Available 15 DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion Sep 18, 2024 Infrared And Visible Image Fusion Scene Understanding
Code Code Available 15 Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset Jul 27, 2021 Scene Text Recognition Scene Understanding
Code Code Available 15 From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 15 CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes Jul 1, 2024 Autonomous Vehicles Image Segmentation
Code Code Available 15 FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions Oct 4, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 15 DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Nov 1, 2019 Autonomous Navigation GPU
Code Code Available 15 Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding Jan 5, 2019 Domain Adaptation Scene Understanding
Code Code Available 15 DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects Mar 27, 2018 General Classification Object
Code Code Available 15 From General to Specific: Informative Scene Graph Generation via Balance Adjustment Aug 30, 2021 Blocking Graph Generation
Code Code Available 15 Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding Jan 28, 2022 Graph Attention Knowledge Distillation
Code Code Available 15 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments May 23, 2024 Pose Estimation Scene Understanding
Code Code Available 15 Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild Jul 23, 2020 Few-Shot Object Detection Meta-Learning
Code Code Available 15 CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation Jul 19, 2023 Representation Learning Scene Understanding
Code Code Available 15 OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge May 31, 2019 object-detection Object Detection
Code Code Available 15 A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 15 FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 15 Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 15 A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Mar 10, 2025 Object Scene Understanding
Code Code Available 15 Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 15 Context Prior for Scene Segmentation Apr 3, 2020 Scene Segmentation Scene Understanding
Code Code Available 15 3DRM:Pair-wise relation module for 3D object detection Feb 20, 2022 3D Object Detection Object
Code Code Available 15 Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 15