Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation Aug 9, 2022 3D Instance Segmentation 3D Part Segmentation
Code Code Available 1TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation Aug 3, 2022 Answer Generation Question-Answer-Generation
Code Code Available 1AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy Aug 3, 2022 Anatomy motion prediction
— Unverified 0Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer Jul 28, 2022 Autonomous Driving Autonomous Vehicles
Code Code Available 2MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud Jul 28, 2022 Scene Understanding
Code Code Available 1CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving Jul 26, 2022 3D Semantic Segmentation Autonomous Driving
Code Code Available 1CompNVS: Novel View Synthesis with Scene Completion Jul 23, 2022 Novel View Synthesis Scene Understanding
— Unverified 0Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models Jul 23, 2022 Scene Understanding
Code Code Available 1Panoptic Scene Graph Generation Jul 22, 2022 Benchmarking Panoptic Scene Graph Generation
Code Code Available 2Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise Binarization Jul 22, 2022 3D Instance Segmentation 3D Object Detection
Code Code Available 1Neural Groundplans: Persistent Neural Scene Representations from a Single Image Jul 22, 2022 Disentanglement Instance Segmentation
— Unverified 0SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany Jul 19, 2022 Image Retrieval Retrieval
— Unverified 0Egocentric Scene Understanding via Multimodal Spatial Rectifier Jul 14, 2022 Scene Understanding Surface Normal Estimation
Code Code Available 1Adversarial Attacks on Monocular Pose Estimation Jul 14, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 0Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments Jul 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 1BlindSpotNet: Seeing Where We Cannot See Jul 8, 2022 Depth Estimation Monocular Depth Estimation
— Unverified 0MCTS with Refinement for Proposals Selection Games in Scene Understanding Jul 7, 2022 Scene Understanding
Code Code Available 1Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases Jul 5, 2022 Object Representation Learning
— Unverified 0Distance Matters in Human-Object Interaction Detection Jul 5, 2022 Human-Object Interaction Detection Object
Code Code Available 0Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation Jul 5, 2022 Dialogue Generation Dialogue Understanding
— Unverified 0Uncertainty-aware Panoptic Segmentation Jun 29, 2022 Panoptic Segmentation Scene Understanding
Code Code Available 1MGNet: Monocular Geometric Scene Understanding for Autonomous Driving Jun 27, 2022 Autonomous Driving Depth Estimation
Code Code Available 1IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Jun 27, 2022 Autonomous Vehicles Scene Segmentation
Code Code Available 1Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings Jun 24, 2022 Scene Understanding Semantic Segmentation
Code Code Available 0Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning Jun 21, 2022 Contrastive Learning Domain Generalization
Code Code Available 1SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding Jun 21, 2022 Clustering Object Discovery
Code Code Available 0A Dynamic Data Driven Approach for Explainable Scene Understanding Jun 18, 2022 Autonomous Driving Scene Understanding
— Unverified 0On Efficient Real-Time Semantic Segmentation: A Survey Jun 17, 2022 GPU object-detection
— Unverified 0Waymo Open Dataset: Panoramic Video Panoptic Segmentation Jun 15, 2022 3D Multi-Object Tracking Autonomous Driving
— Unverified 0A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth Jun 13, 2022 Object object-detection
— Unverified 0Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion Jun 10, 2022 Autonomous Driving Domain Adaptation
Code Code Available 0Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields Jun 9, 2022 Data Augmentation Edge Detection
— Unverified 0Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding Jun 9, 2022 Common Sense Reasoning Scene Understanding
— Unverified 0Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans Jun 6, 2022 Scene Understanding
— Unverified 0A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX Jun 5, 2022 Scene Understanding
— Unverified 0Towards Improving the Generation Quality of Autoregressive Slot VAEs Jun 3, 2022 Image Generation Object
Code Code Available 0SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment Jun 1, 2022 Motion Planning Question Answering
— Unverified 0Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 1Facing the Void: Overcoming Missing Data in Multi-View Imagery May 21, 2022 Classification image-classification
Code Code Available 0Review on Panoramic Imaging and Its Applications in Scene Understanding May 11, 2022 Autonomous Driving Depth Estimation
— Unverified 0Unsupervised Discovery and Composition of Object Light Fields May 8, 2022 Novel View Synthesis Object
— Unverified 0Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects May 5, 2022 Data Augmentation Neural Rendering
— Unverified 0RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds May 2, 2022 Autonomous Driving Decoder
— Unverified 0BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery Apr 27, 2022 object-detection Object Detection
— Unverified 0SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text Apr 25, 2022 Image Retrieval Retrieval
— Unverified 0Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection Apr 25, 2022 3D Object Detection Graph structure learning
— Unverified 0Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation? Apr 23, 2022 Robot Manipulation Scene Understanding
— Unverified 0Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds Apr 22, 2022 3D dense captioning 3D Object Detection
Code Code Available 1SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints Apr 20, 2022 Autonomous Driving Scene Understanding
— Unverified 0Attention Mechanism based Cognition-level Scene Understanding Apr 17, 2022 Question Answering Scene Understanding
— Unverified 0