Distance Matters in Human-Object Interaction Detection Jul 5, 2022 Human-Object Interaction Detection Object
Code Code Available 05 An Information-Theoretic Metric of Transferability for Task Transfer Learning May 1, 2019 General Classification Scene Understanding
Code Code Available 05 Parsing Natural Scenes and Natural Language with Recursive Neural Networks Jun 1, 2011 General Classification Scene Classification
Code Code Available 05 BlitzNet: A Real-Time Deep Network for Scene Understanding Aug 9, 2017 Autonomous Driving Object
Code Code Available 05 PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video Jan 1, 2024 3D Panoptic Segmentation 3D Reconstruction
Code Code Available 05 Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes Aug 18, 2021 Camera Calibration Depth Estimation
Code Code Available 05 Dirty Pixels: Towards End-to-End Image Processing and Perception Jan 23, 2017 Autonomous Driving Deblurring
Code Code Available 05 Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing Dec 24, 2024 Autonomous Driving Autonomous Racing
Code Code Available 05 PENet: A Joint Panoptic Edge Detection Network Mar 15, 2023 Edge Detection Multi-Task Learning
Code Code Available 05 Dilated Residual Networks May 28, 2017 Classification General Classification
Code Code Available 05 A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection Inference Jul 19, 2024 Autonomous Vehicles object-detection
Code Code Available 05 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Dec 31, 2024 3DGS 3D Semantic Segmentation
Code Code Available 05 Adapting Deep Network Features to Capture Psychological Representations Aug 6, 2016 Object Recognition Scene Understanding
Code Code Available 05 OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 05 P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 05 OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Jul 10, 2025 Scene Understanding Spatial Reasoning
Code Code Available 05 Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery Feb 5, 2021 Earth Observation Scene Understanding
Code Code Available 05 OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Mar 18, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 05 On the iterative refinement of densely connected representation levels for semantic segmentation Apr 30, 2018 Image Segmentation Scene Understanding
Code Code Available 05 Depth-Induced Multi-Scale Recurrent Attention Network for Saliency Detection Oct 1, 2019 RGB-D Salient Object Detection Saliency Detection
Code Code Available 05 On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption Sep 2, 2020 Scene Understanding Segmentation
Code Code Available 05 One model to use them all: Training a segmentation model with complementary datasets Feb 29, 2024 All Anatomy
Code Code Available 05 Beyond Human Perception: Understanding Multi-Object World from Monocular View Jan 1, 2025 3D visual grounding Denoising
Code Code Available 05 An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutions Feb 20, 2019 Autonomous Driving Scene Understanding
Code Code Available 05 DenseASPP for Semantic Segmentation in Street Scenes Jun 1, 2018 Autonomous Driving Image Segmentation
Code Code Available 05 Deep Video Deblurring for Hand-Held Cameras Jul 1, 2017 Deblurring Image Deblurring
Code Code Available 05 Deep Video Deblurring Nov 25, 2016 Deblurring Image Deblurring
Code Code Available 05 Deep Surface Normal Estimation with Hierarchical RGB-D Fusion Apr 6, 2019 Scene Understanding Surface Normal Estimation
Code Code Available 05 A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio Oct 1, 2024 Scene Understanding Sound Source Localization
Code Code Available 05 Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields Mar 17, 2024 3D Reconstruction NeRF
Code Code Available 05 Object Attribute Matters in Visual Question Answering Dec 20, 2023 Attribute Graph Neural Network
Code Code Available 05 Object-aware Sound Source Localization via Audio-Visual Scene Understanding Jan 1, 2025 Scene Understanding Sound Source Localization
Code Code Available 05 Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation May 4, 2025 Benchmarking Feature Upsampling
Code Code Available 05 Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer Apr 3, 2019 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 05 Non-central panorama indoor dataset Jan 30, 2024 Scene Understanding
Code Code Available 05 NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data Jan 8, 2025 Autonomous Driving Instance Segmentation
Code Code Available 05 Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship Detection Feb 15, 2019 Relationship Detection Scene Understanding
Code Code Available 05 Neural Radiance Field Codebooks Jan 10, 2023 Object Representation Learning
Code Code Available 05 Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting Jul 6, 2021 3D Object Detection Autonomous Driving
Code Code Available 05 Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera Jan 9, 2019 3D Reconstruction 3D Scene Reconstruction
Code Code Available 05 Deep Learning based Switching Filter for Impulsive Noise Removal in Color Images Dec 3, 2019 Denoising Image Denoising
Code Code Available 05 BACS: Background Aware Continual Semantic Segmentation Apr 19, 2024 Autonomous Driving Continual Learning
Code Code Available 05 Deep Learning--Based Scene Simplification for Bionic Vision Jan 30, 2021 Deep Learning Depth Estimation
Code Code Available 05 DeepIPCv2: LiDAR-powered Robust Environmental Perception and Navigational Control for Autonomous Vehicle Jul 13, 2023 Autonomous Driving Scene Understanding
Code Code Available 05 AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding Feb 27, 2024 3D Object Detection 3D Part Segmentation
Code Code Available 05 Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label Uncertainty May 2, 2018 Scene Understanding Sensor Fusion
Code Code Available 05 Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images Nov 4, 2024 Multi-Task Learning Scene Understanding
Code Code Available 05 Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks? Sep 5, 2018 3D Reconstruction Depth Estimation
Code Code Available 05 AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning Jan 1, 2025 Audio-visual Question Answering Continual Learning
Code Code Available 05 Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation Mar 3, 2021 Autonomous Driving Depth Estimation
Code Code Available 05