Simple Image-level Classification Improves Open-vocabulary Object Detection Dec 16, 2023 Knowledge Distillation Object
Code Code Available 1Transformers in Unsupervised Structure-from-Motion Dec 16, 2023 Decision Making image-classification
Code Code Available 1Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments Dec 14, 2023 3D Reconstruction Decoder
Code Code Available 1Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Dec 5, 2023 3D Object Detection Denoising
Code Code Available 1Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding Nov 30, 2023 GPU Inductive Bias
Code Code Available 1SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation Nov 29, 2023 Scene Segmentation Scene Understanding
Code Code Available 1Panoptic Video Scene Graph Generation Nov 28, 2023 Graph Generation Panoptic Scene Graph Generation
Code Code Available 1Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge Nov 21, 2023 Large Language Model Multimodal Deep Learning
Code Code Available 1TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding Nov 6, 2023 Boundary Detection Depth Estimation
Code Code Available 1NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment Nov 5, 2023 Caption Generation Common Sense Reasoning
Code Code Available 1TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain Nov 1, 2023 Contrastive Learning Image-to-Image Translation
Code Code Available 1Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving Oct 3, 2023 Autonomous Driving Decision Making
Code Code Available 1TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation Oct 3, 2023 Autonomous Driving Scene Understanding
Code Code Available 1Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms Sep 27, 2023 object-detection Object Detection
Code Code Available 1PanopticNDT: Efficient and Robust Panoptic Mapping Sep 24, 2023 2D Panoptic Segmentation 3D Panoptic Segmentation
Code Code Available 1LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data Sep 19, 2023 Anomaly Detection Autonomous Driving
Code Code Available 1Mask4D: End-to-End Mask-Based 4D Panoptic Segmentation for LiDAR Sequences Sep 18, 2023 3D Panoptic Segmentation 4D Panoptic Segmentation
Code Code Available 1HOC-Search: Efficient CAD Model and Pose Retrieval from RGB-D Scans Sep 12, 2023 3D Object Retrieval 3D Scene Reconstruction
Code Code Available 1Multi3DRefer: Grounding Text Description to Multiple 3D Objects Sep 11, 2023 3D visual grounding Contrastive Learning
Code Code Available 1Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning Sep 6, 2023 3D dense captioning Caption Generation
Code Code Available 1Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition Aug 23, 2023 Gesture Recognition Scene Understanding
Code Code Available 1Understanding Dark Scenes by Contrasting Multi-Modal Observations Aug 23, 2023 Contrastive Learning Scene Understanding
Code Code Available 1SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets Aug 23, 2023 Autonomous Navigation Pseudo Label
Code Code Available 1Vision Relation Transformer for Unbiased Scene Graph Generation Aug 18, 2023 Decoder Graph Generation
Code Code Available 1FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 1OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation Jul 28, 2023 Autonomous Driving Scene Understanding
Code Code Available 1Human-centric Scene Understanding for 3D Large-scale Scenarios Jul 26, 2023 Action Recognition Scene Understanding
Code Code Available 1CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation Jul 19, 2023 Representation Learning Scene Understanding
Code Code Available 1Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments Jul 15, 2023 Decoder Grounded Situation Recognition
Code Code Available 1The IMPTC Dataset: An Infrastructural Multi-Person Trajectory and Context Dataset Jul 12, 2023 Scene Understanding
Code Code Available 1CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery Jul 11, 2023 Question Answering Scene Understanding
Code Code Available 1Towards accurate instance segmentation in large-scale LiDAR point clouds Jul 6, 2023 Clustering Instance Segmentation
Code Code Available 1AVSegFormer: Audio-Visual Segmentation with Transformer Jul 3, 2023 Decoder Scene Understanding
Code Code Available 1SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion Jun 27, 2023 Autonomous Driving Scene Understanding
Code Code Available 1Multi-view 3D Object Reconstruction and Uncertainty Modelling with Neural Shape Prior Jun 17, 2023 3D Object Reconstruction Object
Code Code Available 1PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation Jun 16, 2023 3D Panoptic Segmentation Autonomous Driving
Code Code Available 1Estimating Generic 3D Room Structures from 2D Annotations Jun 15, 2023 Scene Understanding
Code Code Available 1SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding Jun 8, 2023 Scene Understanding
Code Code Available 1Towards Label-free Scene Understanding by Vision Foundation Models Jun 6, 2023 image-classification Image Classification
Code Code Available 1Towards In-context Scene Understanding Jun 2, 2023 Depth Estimation In-Context Learning
Code Code Available 1Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast May 31, 2023 3D Instance Segmentation 3D Object Detection
Code Code Available 1Multi-Scale Attention for Audio Question Answering May 29, 2023 Audio Question Answering Question Answering
Code Code Available 1Generating Visual Spatial Description via Holistic 3D Scene Understanding May 19, 2023 Scene Understanding Text Generation
Code Code Available 1Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 1Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond May 11, 2023 Scene Understanding
Code Code Available 1DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization Apr 30, 2023 Decoder NeRF
Code Code Available 1A Review of Panoptic Segmentation for Mobile Mapping Point Clouds Apr 27, 2023 Instance Segmentation Panoptic Segmentation
Code Code Available 1RGB-D Indiscernible Object Counting in Underwater Scenes Apr 23, 2023 Benchmarking Depth Estimation
Code Code Available 1Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation Apr 22, 2023 Autonomous Driving Knowledge Distillation
Code Code Available 1Advances in Deep Concealed Scene Understanding Apr 21, 2023 Scene Understanding Semantic Segmentation
Code Code Available 1