Image Segmentation Using Deep Learning: A Survey Jan 15, 2020 Decoder Deep Learning
Code Code Available 1NODIS: Neural Ordinary Differential Scene Understanding Jan 14, 2020 All Graph Generation
Code Code Available 1Visual-Semantic Graph Attention Networks for Human-Object Interaction Detection Jan 7, 2020 Graph Attention Human-Object Interaction Detection
Code Code Available 1IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation Dec 20, 2019 Disparity Estimation Scene Understanding
Code Code Available 1AeroRIT: A New Scene for Hyperspectral Image Analysis Dec 17, 2019 Hyperspectral image analysis Image Super-Resolution
Code Code Available 1TextSLAM: Visual SLAM with Planar Text Features Nov 26, 2019 Object SLAM Scene Understanding
Code Code Available 1Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN Nov 20, 2019 2k Generative Adversarial Network
Code Code Available 1DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Nov 1, 2019 Autonomous Navigation GPU
Code Code Available 1Underwater Image Super-Resolution using Deep Residual Multipliers Sep 20, 2019 Image Super-Resolution Scene Understanding
Code Code Available 1Global Aggregation then Local Distribution in Fully Convolutional Networks Sep 16, 2019 Instance Segmentation object-detection
Code Code Available 1Dynamic Graph Message Passing Networks Aug 19, 2019 Image Classification object-detection
Code Code Available 1VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering Aug 14, 2019 Embodied Question Answering Question Answering
Code Code Available 1M3D-RPN: Monocular 3D Region Proposal Network for Object Detection Jul 13, 2019 3D Object Detection 3D Object Detection From Monocular Images
Code Code Available 1From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network Jul 8, 2019 3D Object Detection Object
Code Code Available 1OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge May 31, 2019 object-detection Object Detection
Code Code Available 1GFF: Gated Fully Fusion for Semantic Segmentation Apr 3, 2019 Scene Parsing Scene Understanding
Code Code Available 1Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding Jan 5, 2019 Domain Adaptation Scene Understanding
Code Code Available 1Unified Perceptual Parsing for Scene Understanding Jul 26, 2018 2D Semantic Segmentation Scene Understanding
Code Code Available 1Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning Jul 16, 2018 3d scene graph generation Graph Generation
Code Code Available 1Digging Into Self-Supervised Monocular Depth Estimation Jun 4, 2018 Camera Pose Estimation Depth Estimation
Code Code Available 1DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects Mar 27, 2018 General Classification Object
Code Code Available 1Semantic Line Detection and Its Applications Oct 1, 2017 Classification General Classification
Code Code Available 1LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation Jun 14, 2017 GPU Scene Understanding
Code Code Available 1ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes Feb 14, 2017 3D Object Classification General Classification
Code Code Available 1Joint 2D-3D-Semantic Data for Indoor Scene Understanding Feb 3, 2017 Scene Understanding
Code Code Available 1The Cityscapes Dataset for Semantic Urban Scene Understanding Apr 6, 2016 object-detection Object Detection
Code Code Available 1Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding Nov 9, 2015 Decision Making Decoder
Code Code Available 1SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation Nov 2, 2015 Crowd Counting Decoder
Code Code Available 1Microsoft COCO: Common Objects in Context May 1, 2014 Instance Segmentation Object
Code Code Available 1Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models Jul 17, 2025 3D Point Cloud Reconstruction Point cloud reconstruction
— Unverified 0Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection Jul 17, 2025 Scene Understanding
— Unverified 0City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning Jul 17, 2025 Question Answering Scene Understanding
— Unverified 0Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander Jul 15, 2025 Language Modeling Language Modelling
— Unverified 0Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis Jul 15, 2025 Marketing Optical Character Recognition
— Unverified 0EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Jul 14, 2025 Scene Understanding Spatial Reasoning
— Unverified 0OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Jul 10, 2025 Scene Understanding Spatial Reasoning
Code Code Available 0MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Jul 10, 2025 NeRF Object
— Unverified 0What Demands Attention in Urban Street Scenes? From Scene Understanding towards Road Safety: A Survey of Vision-driven Datasets and Studies Jul 9, 2025 Scene Understanding Survey
— Unverified 0VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding Jun 28, 2025 3DGS Instance Segmentation
— Unverified 0CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations Jun 26, 2025 Graph Generation Relation
— Unverified 0Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios Jun 25, 2025 Autonomous Driving Decision Making
— Unverified 0DreamAnywhere: Object-Centric Panoramic 3D Scene Generation Jun 25, 2025 Novel View Synthesis Object
— Unverified 0IPFormer: Visual 3D Panoptic Scene Completion with Context-Adaptive Instance Proposals Jun 25, 2025 Scene Understanding
— Unverified 0HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions Jun 24, 2025 Graph Generation Human-Object Interaction Detection
— Unverified 0Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations Jun 21, 2025 Question Answering Scene Understanding
— Unverified 0SceneAware: Scene-Constrained Pedestrian Trajectory Prediction with LLM-Guided Walkability Jun 17, 2025 Pedestrian Trajectory Prediction Scene Understanding
Code Code Available 0Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment Jun 17, 2025 Autonomous Driving Instance Segmentation
— Unverified 0Unified Representation Space for 3D Visual Grounding Jun 17, 2025 3D visual grounding Contrastive Learning
— Unverified 0Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems Jun 17, 2025 Autonomous Driving Image Segmentation
— Unverified 0FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding Jun 16, 2025 Form Graph Generation
— Unverified 0