Matterport3D: Learning from RGB-D Data in Indoor Environments Sep 18, 2017 General Classification Scene Understanding
Code Code Available 0From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection Feb 15, 2022 Generalized Zero-Shot Object Detection Scene Understanding
Code Code Available 0CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation Jan 16, 2025 Novel View Synthesis Scene Understanding
Code Code Available 0Structured Label Inference for Visual Understanding Feb 18, 2018 Action Detection General Classification
Code Code Available 0AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding Feb 27, 2024 3D Object Detection 3D Part Segmentation
Code Code Available 0From Feature Importance to Natural Language Explanations Using LLMs with RAG Jul 30, 2024 counterfactual Counterfactual Reasoning
Code Code Available 0m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks Aug 23, 2020 Anatomy Data Augmentation
Code Code Available 0Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation Oct 31, 2018 3D Object Detection Camera Pose Estimation
Code Code Available 0Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans Dec 1, 2021 4D Panoptic Segmentation Autonomous Navigation
Code Code Available 0FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding Apr 4, 2023 Autonomous Driving Domain Adaptation
Code Code Available 0LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual Semantics Apr 16, 2018 Navigate Scene Understanding
Code Code Available 0Loss Switching Fusion with Similarity Search for Video Classification Jun 27, 2019 Classification Clustering
Code Code Available 0Loss Distillation via Gradient Matching for Point Cloud Completion with Weighted Chamfer Distance Sep 10, 2024 Bilevel Optimization Point Cloud Completion
Code Code Available 0AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning Jan 1, 2025 Audio-visual Question Answering Continual Learning
Code Code Available 0Continual Learning of Unsupervised Monocular Depth from Videos Nov 4, 2023 Autonomous Driving Continual Learning
Code Code Available 0FlowGrad: Using Motion for Visual Sound Source Localization Nov 15, 2022 Optical Flow Estimation Scene Understanding
Code Code Available 0An Information-Theoretic Metric of Transferability for Task Transfer Learning May 1, 2019 General Classification Scene Understanding
Code Code Available 0SceneAware: Scene-Constrained Pedestrian Trajectory Prediction with LLM-Guided Walkability Jun 17, 2025 Pedestrian Trajectory Prediction Scene Understanding
Code Code Available 0LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition Nov 27, 2024 Action Recognition Graph Attention
Code Code Available 0Lightweight integration of 3D features to improve 2D image segmentation Dec 16, 2022 Image Segmentation Scene Understanding
Code Code Available 0Surgical Scene Segmentation by Transformer With Asymmetric Feature Enhancement Oct 23, 2024 Anatomy Scene Segmentation
Code Code Available 0Constructing a Visual Relationship Authenticity Dataset Oct 11, 2020 Relationship Detection Scene Understanding
Code Code Available 0Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding Dec 22, 2022 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
Code Code Available 0Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond Aberrations Nov 21, 2022 Domain Adaptation Scene Understanding
Code Code Available 0Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding Apr 18, 2025 Deep Learning Point Cloud Completion
Code Code Available 0Flow-based GAN for 3D Point Cloud Generation from a Single Image Oct 8, 2022 Point Cloud Generation Scene Understanding
Code Code Available 0Scene Graph Generation from Objects, Phrases and Region Captions Jul 31, 2017 Graph Generation object-detection
Code Code Available 0Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation May 30, 2023 Graph Generation Image Generation
Code Code Available 0Auxiliary Tasks in Multi-task Learning May 16, 2018 Depth Estimation Multi-Task Learning
Code Code Available 0Auto-Embedding Generative Adversarial Networks for High Resolution Image Synthesis Mar 27, 2019 Generative Adversarial Network Image Generation
Code Code Available 0Implicit Background Estimation for Semantic Segmentation May 23, 2019 Scene Understanding Segmentation
Code Code Available 0SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth Dec 15, 2016 3D Reconstruction Camera Pose Estimation
Code Code Available 0Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings Jun 24, 2022 Scene Understanding Semantic Segmentation
Code Code Available 0SceneNet: Understanding Real World Indoor Scenes With Synthetic Data Nov 22, 2015 Scene Understanding
Code Code Available 0Fast Scene Understanding for Autonomous Driving Aug 8, 2017 Autonomous Driving Decoder
Code Code Available 0Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search Jul 10, 2024 Few-Shot Learning GPU
Code Code Available 0Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory Jul 4, 2021 Question Answering Scene Understanding
Code Code Available 0Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning Aug 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 0Attend, Infer, Repeat: Fast Scene Understanding with Generative Models Mar 28, 2016 Scene Understanding
Code Code Available 0A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection Inference Jul 19, 2024 Autonomous Vehicles object-detection
Code Code Available 0False Negative Reduction in Video Instance Segmentation using Uncertainty Estimates Jun 28, 2021 Depth Estimation Instance Segmentation
Code Code Available 03D Semantic Segmentation of Modular Furniture using rjMCMC May 15, 2017 3D Semantic Segmentation furniture segmentation
Code Code Available 0Uncertainty-aware LiDAR Panoptic Segmentation Oct 10, 2022 Autonomous Driving Panoptic Segmentation
Code Code Available 0Facing the Void: Overcoming Missing Data in Multi-View Imagery May 21, 2022 Classification image-classification
Code Code Available 0COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images Jan 26, 2016 Diversity General Classification
Code Code Available 0CNN-based Lidar Point Cloud De-Noising in Adverse Weather Dec 9, 2019 Autonomous Vehicles Scene Understanding
Code Code Available 0AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding Aug 30, 2024 Language Modelling Large Language Model
Code Code Available 0An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutions Feb 20, 2019 Autonomous Driving Scene Understanding
Code Code Available 0SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding Jun 21, 2022 Clustering Object Discovery
Code Code Available 0Extremely Fine-Grained Visual Classification over Resembling Glyphs in the Wild Aug 25, 2024 Contrastive Learning Fine-Grained Image Classification
Code Code Available 0