Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration Dec 17, 2024 audio-visual event localization audio-visual learning
Code Code Available 15 ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Apr 16, 2024 3D Semantic Segmentation Management
Code Code Available 15 Egocentric Scene Understanding via Multimodal Spatial Rectifier Jul 14, 2022 Scene Understanding Surface Normal Estimation
Code Code Available 15 Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering Jun 4, 2021 Meta-Learning Scene Understanding
Code Code Available 15 BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments Sep 22, 2020 Domain Adaptation Scene Understanding
Code Code Available 15 Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments Jul 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 15 Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model Mar 30, 2025 Depth Estimation Monocular Depth Estimation
Code Code Available 15 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Nov 29, 2024 3D geometry 3DGS
Code Code Available 15 3DP3: 3D Scene Perception via Probabilistic Programming Oct 30, 2021 Object Pose Estimation
Code Code Available 15 Digging Into Self-Supervised Monocular Depth Estimation Jun 4, 2018 Camera Pose Estimation Depth Estimation
Code Code Available 15 A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Mar 10, 2025 Object Scene Understanding
Code Code Available 15 Boundary-induced and scene-aggregated network for monocular depth prediction Feb 26, 2021 Depth Estimation Depth Prediction
Code Code Available 15 Learning and Reasoning with the Graph Structure Representation in Robotic Surgery Jul 7, 2020 Edge Classification Graph Generation
Code Code Available 15 Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation Sep 20, 2021 Decoder Prediction
Code Code Available 15 LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data Sep 19, 2023 Anomaly Detection Autonomous Driving
Code Code Available 15 DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization Aug 24, 2021 Diversity Graph Neural Network
Code Code Available 15 Event-aided Semantic Scene Completion Feb 4, 2025 Autonomous Driving Scene Understanding
Code Code Available 15 LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving Dec 7, 2022 Autonomous Driving Instance Segmentation
Code Code Available 15 Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 15 Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 15 Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis Mar 9, 2021 3d scene graph generation graph construction
Code Code Available 15 Explainable Object-induced Action Decision for Autonomous Vehicles Mar 20, 2020 Autonomous Driving Autonomous Vehicles
Code Code Available 15 A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 15 F-ViTA: Foundation Model Guided Visible to Thermal Translation Apr 3, 2025 Scene Understanding Style Transfer
Code Code Available 15 MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding Oct 1, 2020 Deep Learning image-classification
Code Code Available 15 Joint 2D-3D-Semantic Data for Indoor Scene Understanding Feb 3, 2017 Scene Understanding
Code Code Available 15 CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks Mar 28, 2020 3D Medical Imaging Segmentation Action Recognition
Code Code Available 15 KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D Sep 28, 2021 Multiple Object Tracking Novel View Synthesis
Code Code Available 15 AVSegFormer: Audio-Visual Segmentation with Transformer Jul 3, 2023 Decoder Scene Understanding
Code Code Available 15 DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects Mar 27, 2018 General Classification Object
Code Code Available 15 IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation Dec 20, 2019 Disparity Estimation Scene Understanding
Code Code Available 15 FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 15 Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation Apr 22, 2023 Autonomous Driving Knowledge Distillation
Code Code Available 15 Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene Aug 11, 2020 Instance Segmentation Point Cloud Segmentation
Code Code Available 15 Monte Carlo Scene Search for 3D Scene Understanding Mar 14, 2021 Scene Understanding
Code Code Available 15 FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions Oct 4, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 15 From General to Specific: Informative Scene Graph Generation via Balance Adjustment Aug 30, 2021 Blocking Graph Generation
Code Code Available 15 From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 15 Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks Feb 17, 2023 Deblurring Deep Learning
Code Code Available 15 Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks Aug 17, 2021 3D Instance Segmentation Instance Segmentation
Code Code Available 15 MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering Mar 17, 2022 Implicit Relations Question Answering
Code Code Available 15 Deep learning for radar data exploitation of autonomous vehicle Mar 15, 2022 Autonomous Driving Deep Learning
Code Code Available 15 Instance-wise Occlusion and Depth Orders in Natural Scenes Nov 29, 2021 Depth Estimation Depth Prediction
Code Code Available 15 Lane Graph Estimation for Scene Understanding in Urban Driving May 1, 2021 Autonomous Driving Autonomous Vehicles
Code Code Available 15 3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding Jan 6, 2024 Scene Understanding Visual Question Answering (VQA)
Code Code Available 15 All-Day Multi-Camera Multi-Target Tracking Jan 1, 2025 All Mamba
Code Code Available 15 DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Nov 1, 2019 Autonomous Navigation GPU
Code Code Available 15 Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation Oct 30, 2020 Instance Segmentation Panoptic Segmentation
Code Code Available 15 GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields Apr 1, 2024 Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation
Code Code Available 15 DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Apr 16, 2025 Few-Shot Learning Interactive Segmentation
Code Code Available 15