Transavs: End-To-End Audio-Visual Segmentation With Transformer May 12, 2023 Scene Understanding Segmentation
— Unverified 0Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond May 11, 2023 Scene Understanding
Code Code Available 1Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs May 10, 2023 Scene Understanding Visual Reasoning
— Unverified 0Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding May 8, 2023 Prediction Scene Understanding
— Unverified 0Living in a Material World: Learning Material Properties from Full-Waveform Flash Lidar Data for Semantic Segmentation May 7, 2023 Scene Understanding Semantic Segmentation
— Unverified 0Learning-based Relational Object Matching Across Views May 3, 2023 Graph Neural Network Image Retrieval
— Unverified 0ArK: Augmented Reality with Knowledge Interactive Emergent Ability May 1, 2023 AI Agent Mixed Reality
— Unverified 0TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding May 1, 2023 3D Object Detection Monocular Depth Estimation
Code Code Available 2DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization Apr 30, 2023 Decoder NeRF
Code Code Available 1Neural Implicit Dense Semantic SLAM Apr 27, 2023 3D geometry Scene Understanding
— Unverified 0A Review of Panoptic Segmentation for Mobile Mapping Point Clouds Apr 27, 2023 Instance Segmentation Panoptic Segmentation
Code Code Available 1Compositional 3D Human-Object Neural Animation Apr 27, 2023 Human-Object Interaction Detection NeRF
— Unverified 0ZRG: A Dataset for Multimodal 3D Residential Rooftop Understanding Apr 26, 2023 Scene Understanding
— Unverified 0RGB-D Indiscernible Object Counting in Underwater Scenes Apr 23, 2023 Benchmarking Depth Estimation
Code Code Available 1Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation Apr 22, 2023 Autonomous Driving Knowledge Distillation
Code Code Available 1Advances in Deep Concealed Scene Understanding Apr 21, 2023 Scene Understanding Semantic Segmentation
Code Code Available 1Factored Neural Representation for Scene Understanding Apr 21, 2023 Novel View Synthesis Object
— Unverified 0RS2G: Data-Driven Scene-Graph Extraction and Embedding for Robust Autonomous Perception and Scenario Understanding Apr 17, 2023 Autonomous Vehicles Graph Learning
Code Code Available 1360^ High-Resolution Depth Estimation via Uncertainty-aware Structural Knowledge Transfer Apr 17, 2023 Depth Estimation Monocular Depth Estimation
— Unverified 0Learning How To Robustly Estimate Camera Pose in Endoscopic Videos Apr 17, 2023 3D Reconstruction Camera Pose Estimation
Code Code Available 1STRAP: Structured Object Affordance Segmentation with Point Supervision Apr 17, 2023 Object Scene Understanding
Code Code Available 1ViPLO: Vision Transformer based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection Apr 17, 2023 Human-Object Interaction Detection Quantization
Code Code Available 1Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding Apr 14, 2023 3D Object Detection Scene Understanding
Code Code Available 2iDisc: Internal Discretization for Monocular Depth Estimation Apr 13, 2023 Autonomous Driving Depth Estimation
Code Code Available 3Graph-based Topology Reasoning for Driving Scenes Apr 11, 2023 3D Lane Detection Autonomous Driving
Code Code Available 2Semantic Segmentation with High Inference Speed in Off-Road Environments Apr 10, 2023 2D Semantic Segmentation Autonomous Vehicles
Code Code Available 0Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation Apr 10, 2023 Panoptic Segmentation Scene Understanding
— Unverified 0FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding Apr 4, 2023 Autonomous Driving Domain Adaptation
Code Code Available 0RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding Apr 3, 2023 Contrastive Learning Instance Segmentation
Code Code Available 2Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings Mar 30, 2023 Object Scene Understanding
— Unverified 0Complementary Random Masking for RGB-Thermal Semantic Segmentation Mar 30, 2023 Scene Understanding Semantic Segmentation
Code Code Available 1DPF: Learning Dense Prediction Fields with Weak Supervision Mar 29, 2023 Intrinsic Image Decomposition Prediction
Code Code Available 1HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation Mar 28, 2023 Panoptic Scene Graph Generation Scene Graph Generation
Code Code Available 1Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments Mar 27, 2023 Autonomous Navigation Real-Time Semantic Segmentation
Code Code Available 1You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding Mar 26, 2023 3D Instance Segmentation Instance Segmentation
Code Code Available 1Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation Mar 25, 2023 Domain Adaptation ERP
— Unverified 0Viewpoint Equivariance for Multi-View 3D Object Detection Mar 25, 2023 3D Object Detection Object
Code Code Available 1OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 0Self-distillation for surgical action recognition Mar 22, 2023 Action Recognition Medical Image Analysis
Code Code Available 1Uni-Fusion: Universal Continuous Mapping Mar 22, 2023 Scene Understanding
— Unverified 0Semantic segmentation of surgical hyperspectral images under geometric domain shifts Mar 20, 2023 Organ Segmentation Scene Segmentation
— Unverified 0Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor Localization Mar 20, 2023 3D Object Detection Indoor Localization
Code Code Available 1CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition Mar 20, 2023 Retrieval Scene Understanding
Code Code Available 2Content Adaptive Front End For Audio Classification Mar 18, 2023 Audio Classification Audio Signal Processing
— Unverified 0Efficient Computation Sharing for Multi-Task Visual Scene Understanding Mar 16, 2023 Multi-Task Learning Scene Understanding
Code Code Available 0Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery Mar 16, 2023 Scene Understanding
— Unverified 0SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving Mar 16, 2023 3D Object Detection Autonomous Driving
Code Code Available 3PENet: A Joint Panoptic Edge Detection Network Mar 15, 2023 Edge Detection Multi-Task Learning
Code Code Available 0PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection Mar 14, 2023 3D Object Detection Decoder
Code Code Available 1Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast Mar 11, 2023 3D Semantic Segmentation Contrastive Learning
— Unverified 0