One-Shot Object Affordance Detection in the Wild Aug 8, 2021 Action Recognition Affordance Detection
Code Code Available 15 Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments Mar 27, 2023 Autonomous Navigation Real-Time Semantic Segmentation
Code Code Available 15 You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding Mar 26, 2023 3D Instance Segmentation Instance Segmentation
Code Code Available 15 Online 3D reconstruction and dense tracking in endoscopic videos Sep 9, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 15 ReorientBot: Learning Object Reorientation for Specific-Posed Placement Feb 22, 2022 Motion Generation Motion Planning
Code Code Available 15 CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks Mar 28, 2020 3D Medical Imaging Segmentation Action Recognition
Code Code Available 15 Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 15 Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance Dec 17, 2023 3D Instance Segmentation 3D Open-Vocabulary Instance Segmentation
Code Code Available 15 Egocentric Scene Understanding via Multimodal Spatial Rectifier Jul 14, 2022 Scene Understanding Surface Normal Estimation
Code Code Available 15 Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding Apr 16, 2020 Human Part Segmentation Panoptic Segmentation
Code Code Available 15 FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 15 Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis Mar 9, 2021 3d scene graph generation graph construction
Code Code Available 15 Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 15 Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 15 Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments Jul 15, 2023 Decoder Grounded Situation Recognition
Code Code Available 15 ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data Nov 17, 2021 3D Object Detection object-detection
Code Code Available 15 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding Jan 14, 2025 Language Modeling Language Modelling
Code Code Available 15 OvarNet: Towards Open-vocabulary Object Attribute Recognition Jan 23, 2023 Attribute Knowledge Distillation
Code Code Available 15 Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene Aug 11, 2020 Instance Segmentation Point Cloud Segmentation
Code Code Available 15 Who2com: Collaborative Perception via Learnable Handshake Communication Mar 21, 2020 Multi-agent Reinforcement Learning Reinforcement Learning
Code Code Available 15 Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation Dec 24, 2021 Depth Estimation Depth Prediction
Code Code Available 15 Explainable Object-induced Action Decision for Autonomous Vehicles Mar 20, 2020 Autonomous Driving Autonomous Vehicles
Code Code Available 15 PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction Apr 16, 2024 3D Reconstruction 3D Shape Reconstruction
Code Code Available 15 Panoptic 3D Scene Reconstruction From a Single RGB Image Nov 3, 2021 2D Panoptic Segmentation 3D Instance Segmentation
Code Code Available 15 RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving Jan 14, 2024 Autonomous Driving Benchmarking
Code Code Available 15 Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation Aug 9, 2022 3D Instance Segmentation 3D Part Segmentation
Code Code Available 15 Panoptic Video Scene Graph Generation Nov 28, 2023 Graph Generation Panoptic Scene Graph Generation
Code Code Available 15 Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning Jun 21, 2022 Contrastive Learning Domain Generalization
Code Code Available 15 Towards Efficient Scene Understanding via Squeeze Reasoning Nov 6, 2020 Instance Segmentation object-detection
Code Code Available 15 Predicting Deeper into the Future of Semantic Segmentation Mar 22, 2017 Attribute Autonomous Driving
Code Code Available 05 Category-level Neural Field for Reconstruction of Partially Observed Objects in Indoor Environment Jun 12, 2024 3D Reconstruction Scene Understanding
Code Code Available 05 Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding Apr 20, 2025 Autonomous Driving Image Captioning
Code Code Available 05 Pose-aware Multi-level Feature Network for Human Object Interaction Detection Sep 18, 2019 Human-Object Interaction Detection Object
Code Code Available 05 Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models Apr 6, 2025 Computational Efficiency General Knowledge
Code Code Available 05 Evaluating Compositional Scene Understanding in Multimodal Generative Models Mar 29, 2025 Scene Understanding
Code Code Available 05 A Review on Deep Learning Techniques Applied to Semantic Segmentation Apr 22, 2017 Autonomous Driving Deep Learning
Code Code Available 05 ERFNet: Efficient Residual Factorized ConvNet for Real-time Semantic Segmentation Oct 9, 2017 GPU Real-Time Semantic Segmentation
Code Code Available 05 Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video May 27, 2019 Inductive Bias Model Predictive Control
Code Code Available 05 PENet: A Joint Panoptic Edge Detection Network Mar 15, 2023 Edge Detection Multi-Task Learning
Code Code Available 05 CARL-D: A vision benchmark suite and large scale dataset for vehicle detection and scene segmentation Feb 17, 2022 2D Object Detection Autonomous Driving
Code Code Available 05 Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding Oct 19, 2024 Autonomous Driving object-detection
Code Code Available 05 Parsing Geometry Using Structure-Aware Shape Templates Aug 3, 2018 Object Object Recognition
Code Code Available 05 Parsing Natural Scenes and Natural Language with Recursive Neural Networks Jun 1, 2011 General Classification Scene Classification
Code Code Available 05 Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes Aug 18, 2021 Camera Calibration Depth Estimation
Code Code Available 05 PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video Jan 1, 2024 3D Panoptic Segmentation 3D Reconstruction
Code Code Available 05 OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 05 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Dec 31, 2024 3DGS 3D Semantic Segmentation
Code Code Available 05 OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Jul 10, 2025 Scene Understanding Spatial Reasoning
Code Code Available 05 P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 05 Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing Dec 24, 2024 Autonomous Driving Autonomous Racing
Code Code Available 05