Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild Jul 23, 2020 Few-Shot Object Detection Meta-Learning
Code Code Available 1IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving Jun 1, 2020 3D Object Detection Autonomous Driving
Code Code Available 1Image Segmentation Using Deep Learning: A Survey Jan 15, 2020 Decoder Deep Learning
Code Code Available 1Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping Apr 1, 2024 image-classification Image Classification
Code Code Available 1BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments Sep 22, 2020 Domain Adaptation Scene Understanding
Code Code Available 1Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation Dec 24, 2021 Depth Estimation Depth Prediction
Code Code Available 1FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 1Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Nov 29, 2024 3D geometry 3DGS
Code Code Available 1AVSegFormer: Audio-Visual Segmentation with Transformer Jul 3, 2023 Decoder Scene Understanding
Code Code Available 1FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 1A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Mar 10, 2025 Object Scene Understanding
Code Code Available 1Boundary-induced and scene-aggregated network for monocular depth prediction Feb 26, 2021 Depth Estimation Depth Prediction
Code Code Available 1KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D Sep 28, 2021 Multiple Object Tracking Novel View Synthesis
Code Code Available 1Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation Apr 22, 2023 Autonomous Driving Knowledge Distillation
Code Code Available 1OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge May 31, 2019 object-detection Object Detection
Code Code Available 1Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding Nov 30, 2023 GPU Inductive Bias
Code Code Available 1Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 1Learning and Reasoning with the Graph Structure Representation in Robotic Surgery Jul 7, 2020 Edge Classification Graph Generation
Code Code Available 1Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 1Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 1Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views Nov 13, 2021 Object Scene Understanding
Code Code Available 1Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection Dec 19, 2022 3D Object Detection Knowledge Distillation
Code Code Available 1LED: Light Enhanced Depth Estimation at Night Sep 12, 2024 Autonomous Driving Decoder
Code Code Available 1Leveraging Large (Visual) Language Models for Robot 3D Scene Understanding Sep 12, 2022 Common Sense Reasoning Scene Classification
Code Code Available 1Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 1Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering Jun 4, 2021 Meta-Learning Scene Understanding
Code Code Available 1CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks Mar 28, 2020 3D Medical Imaging Segmentation Action Recognition
Code Code Available 1Explainable Object-induced Action Decision for Autonomous Vehicles Mar 20, 2020 Autonomous Driving Autonomous Vehicles
Code Code Available 1LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Apr 30, 2025 In-Context Learning Object
Code Code Available 1CamContextI2V: Context-aware Controllable Video Generation Apr 8, 2025 Diversity Scene Understanding
Code Code Available 1Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis Mar 9, 2021 3d scene graph generation graph construction
Code Code Available 1A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 1FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation Mar 1, 2021 3D Semantic Segmentation Decoder
Code Code Available 1Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene Aug 11, 2020 Instance Segmentation Point Cloud Segmentation
Code Code Available 1Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding Apr 9, 2025 Scene Understanding Self-Supervised Learning
Code Code Available 1MassMIND: Massachusetts Maritime INfrared Dataset Sep 9, 2022 Instance Segmentation Scene Understanding
Code Code Available 1MGNet: Monocular Geometric Scene Understanding for Autonomous Driving Jun 27, 2022 Autonomous Driving Depth Estimation
Code Code Available 1Microsoft COCO: Common Objects in Context May 1, 2014 Instance Segmentation Object
Code Code Available 1Advances in Deep Concealed Scene Understanding Apr 21, 2023 Scene Understanding Semantic Segmentation
Code Code Available 1CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery Jul 11, 2023 Question Answering Scene Understanding
Code Code Available 1All-Day Multi-Camera Multi-Target Tracking Jan 1, 2025 All Mamba
Code Code Available 1Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model Oct 25, 2020 Depth Estimation Depth Prediction
Code Code Available 1Estimating Generic 3D Room Structures from 2D Annotations Jun 15, 2023 Scene Understanding
Code Code Available 1Monte Carlo Scene Search for 3D Scene Understanding Mar 14, 2021 Scene Understanding
Code Code Available 13DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding Jan 6, 2024 Scene Understanding Visual Question Answering (VQA)
Code Code Available 1MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Jul 2, 2024 Boundary Detection Human Parsing
Code Code Available 1Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation Oct 30, 2020 Instance Segmentation Panoptic Segmentation
Code Code Available 1Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation Sep 20, 2021 Decoder Prediction
Code Code Available 1Event-aided Semantic Scene Completion Feb 4, 2025 Autonomous Driving Scene Understanding
Code Code Available 1Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor Setups Jan 12, 2021 Scene Understanding
Code Code Available 1