Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding Dec 22, 2022 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
Code Code Available 05 MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and Classification Jul 25, 2019 Autonomous Vehicles Classification
Code Code Available 05 Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond Aberrations Nov 21, 2022 Domain Adaptation Scene Understanding
Code Code Available 05 General-Purpose Deep Point Cloud Feature Extractor Mar 12, 2018 3D Object Classification 3D Point Cloud Classification
Code Code Available 05 Attend, Infer, Repeat: Fast Scene Understanding with Generative Models Mar 28, 2016 Scene Understanding
Code Code Available 05 MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization Nov 26, 2018 2D Object Detection 3D Object Detection
Code Code Available 05 Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis Jun 28, 2023 Scene Understanding
Code Code Available 05 Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud Mar 23, 2019 3D Object Detection Depth Estimation
Code Code Available 05 MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking Apr 9, 2025 Autonomous Driving Language Modeling
Code Code Available 05 Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation Mar 3, 2021 Autonomous Driving Depth Estimation
Code Code Available 05 Model-based inexact graph matching on top of CNNs for semantic scene understanding Jan 18, 2023 Brain Segmentation Deep Learning
Code Code Available 05 Modeling Expectation Violation in Intuitive Physics with Coarse Probabilistic Object Representations Dec 1, 2019 Scene Understanding
Code Code Available 05 Gated Driver Attention Predictor Aug 1, 2023 Driver Attention Monitoring Prediction
Code Code Available 05 Gated2Depth: Real-time Dense Lidar from Gated Images Feb 13, 2019 Scene Understanding
Code Code Available 05 MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities Aug 14, 2020 Representation Learning Scene Understanding
Code Code Available 05 MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios Dec 27, 2024 Autonomous Driving Language Modeling
Code Code Available 05 GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation Oct 2, 2022 Scene Understanding Segmentation
Code Code Available 05 MGNiceNet: Unified Monocular Geometric Scene Understanding Nov 18, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 05 Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange Apr 11, 2024 Object Scene Understanding
Code Code Available 05 METEOR Guided Divergence for Video Captioning Dec 20, 2022 Hierarchical Reinforcement Learning Scene Understanding
Code Code Available 05 Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory Jul 4, 2021 Question Answering Scene Understanding
Code Code Available 05 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Nov 16, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 05 On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption Sep 2, 2020 Scene Understanding Segmentation
Code Code Available 05 FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild Jan 8, 2024 Language Modelling Large Language Model
Code Code Available 05 m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks Aug 23, 2020 Anatomy Data Augmentation
Code Code Available 05 COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images Jan 26, 2016 Diversity General Classification
Code Code Available 05 From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection Feb 15, 2022 Generalized Zero-Shot Object Detection Scene Understanding
Code Code Available 05 Loss Distillation via Gradient Matching for Point Cloud Completion with Weighted Chamfer Distance Sep 10, 2024 Bilevel Optimization Point Cloud Completion
Code Code Available 05 From Feature Importance to Natural Language Explanations Using LLMs with RAG Jul 30, 2024 counterfactual Counterfactual Reasoning
Code Code Available 05 CNN-based Lidar Point Cloud De-Noising in Adverse Weather Dec 9, 2019 Autonomous Vehicles Scene Understanding
Code Code Available 05 Loss Switching Fusion with Similarity Search for Video Classification Jun 27, 2019 Classification Clustering
Code Code Available 05 LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition Nov 27, 2024 Action Recognition Graph Attention
Code Code Available 05 LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual Semantics Apr 16, 2018 Navigate Scene Understanding
Code Code Available 05 FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding Apr 4, 2023 Autonomous Driving Domain Adaptation
Code Code Available 05 Lightweight integration of 3D features to improve 2D image segmentation Dec 16, 2022 Image Segmentation Scene Understanding
Code Code Available 05 Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video May 27, 2019 Inductive Bias Model Predictive Control
Code Code Available 05 Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning Aug 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 05 Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding Apr 18, 2025 Deep Learning Point Cloud Completion
Code Code Available 05 FlowGrad: Using Motion for Visual Sound Source Localization Nov 15, 2022 Optical Flow Estimation Scene Understanding
Code Code Available 05 Flow-based GAN for 3D Point Cloud Generation from a Single Image Oct 8, 2022 Point Cloud Generation Scene Understanding
Code Code Available 05 Aerial Scene Understanding in The Wild: Multi-Scene Recognition via Prototype-based Memory Networks Apr 22, 2021 Retrieval Scene Recognition
Code Code Available 05 Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation Apr 12, 2018 Optical Flow Estimation Scene Flow Estimation
Code Code Available 05 Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation May 30, 2023 Graph Generation Image Generation
Code Code Available 05 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Jul 28, 2024 Contrastive Learning Intention-oriented Segmentation
Code Code Available 05 Matterport3D: Learning from RGB-D Data in Indoor Environments Sep 18, 2017 General Classification Scene Understanding
Code Code Available 05 Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings Jun 24, 2022 Scene Understanding Semantic Segmentation
Code Code Available 05 Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors May 30, 2025 3D geometry Large Language Model
Code Code Available 05 Learning Monocular Depth by Distilling Cross-domain Stereo Networks Aug 20, 2018 Autonomous Driving Depth Estimation
Code Code Available 05 Learning Panoptic Segmentation from Instance Contours Oct 16, 2020 Clustering Instance Segmentation
Code Code Available 05 Language-based Colorization of Scene Sketches Nov 17, 2019 Colorization Image Generation
Code Code Available 05