Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video May 27, 2019 Inductive Bias Model Predictive Control
Code Code Available 05 PENet: A Joint Panoptic Edge Detection Network Mar 15, 2023 Edge Detection Multi-Task Learning
Code Code Available 05 Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation Mar 19, 2024 Domain Adaptation Object
Code Code Available 05 Parsing Natural Scenes and Natural Language with Recursive Neural Networks Jun 1, 2011 General Classification Scene Classification
Code Code Available 05 Efficient ConvNet for Real-time Semantic Segmentation Jun 1, 2017 GPU Real-Time Semantic Segmentation
Code Code Available 05 Efficient Computation Sharing for Multi-Task Visual Scene Understanding Mar 16, 2023 Multi-Task Learning Scene Understanding
Code Code Available 05 Parsing Geometry Using Structure-Aware Shape Templates Aug 3, 2018 Object Object Recognition
Code Code Available 05 Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding Oct 19, 2024 Autonomous Driving object-detection
Code Code Available 05 PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video Jan 1, 2024 3D Panoptic Segmentation 3D Reconstruction
Code Code Available 05 Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing Dec 24, 2024 Autonomous Driving Autonomous Racing
Code Code Available 05 3D Semantic Segmentation of Modular Furniture using rjMCMC May 15, 2017 3D Semantic Segmentation furniture segmentation
Code Code Available 05 Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models Apr 6, 2025 Computational Efficiency General Knowledge
Code Code Available 05 Real-time 3D Traffic Cone Detection for Autonomous Driving Feb 6, 2019 3D Object Detection Autonomous Driving
Code Code Available 05 SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences Apr 2, 2019 3D Semantic Segmentation Scene Understanding
Code Code Available 05 P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 05 Bridging Stereo Matching and Optical Flow via Spatiotemporal Correspondence May 22, 2019 Optical Flow Estimation Scene Understanding
Code Code Available 05 AP-MTL: Attention Pruned Multi-task Learning Model for Real-time Instrument Detection and Segmentation in Robot-assisted Surgery Mar 10, 2020 Multi-Task Learning Scene Understanding
Code Code Available 05 DualMLP: a two-stream fusion model for 3D point cloud classification Oct 10, 2023 3D Point Cloud Classification Point Cloud Classification
Code Code Available 05 OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 05 Dual-Glance Model for Deciphering Social Relationships Aug 2, 2017 model object-detection
Code Code Available 05 A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap Jul 31, 2024 Human-Object Interaction Detection Image Reconstruction
Code Code Available 05 OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Dec 31, 2024 3DGS 3D Semantic Segmentation
Code Code Available 05 Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning Nov 26, 2024 Object object-detection
Code Code Available 05 APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds May 15, 2025 Point Cloud Segmentation Scene Understanding
Code Code Available 05 OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Jul 10, 2025 Scene Understanding Spatial Reasoning
Code Code Available 05 DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection May 14, 2025 object-detection Object Detection
Code Code Available 05 Boundary-Seeking Generative Adversarial Networks Feb 27, 2017 Scene Understanding Text Generation
Code Code Available 05 OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Mar 18, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 05 Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse Weather Nov 21, 2022 Autonomous Driving GPU
Code Code Available 05 On the iterative refinement of densely connected representation levels for semantic segmentation Apr 30, 2018 Image Segmentation Scene Understanding
Code Code Available 05 AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding Aug 30, 2024 Language Modelling Large Language Model
Code Code Available 05 On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption Sep 2, 2020 Scene Understanding Segmentation
Code Code Available 05 DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding Mar 25, 2024 Decoder Object
Code Code Available 05 D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation Sep 29, 2021 Depth Estimation Monocular Depth Estimation
Code Code Available 05 Adaptive Visual Scene Understanding: Incremental Scene Graph Generation Oct 2, 2023 Benchmarking Continual Learning
Code Code Available 05 BOLD5000: A public fMRI dataset of 5000 images Sep 5, 2018 Diversity Scene Understanding
Code Code Available 05 Distance Matters in Human-Object Interaction Detection Jul 5, 2022 Human-Object Interaction Detection Object
Code Code Available 05 An Information-Theoretic Metric of Transferability for Task Transfer Learning May 1, 2019 General Classification Scene Understanding
Code Code Available 05 Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields Mar 17, 2024 3D Reconstruction NeRF
Code Code Available 05 BlitzNet: A Real-Time Deep Network for Scene Understanding Aug 9, 2017 Autonomous Driving Object
Code Code Available 05 Non-central panorama indoor dataset Jan 30, 2024 Scene Understanding
Code Code Available 05 Dirty Pixels: Towards End-to-End Image Processing and Perception Jan 23, 2017 Autonomous Driving Deblurring
Code Code Available 05 Object Attribute Matters in Visual Question Answering Dec 20, 2023 Attribute Graph Neural Network
Code Code Available 05 Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera Jan 9, 2019 3D Reconstruction 3D Scene Reconstruction
Code Code Available 05 NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data Jan 8, 2025 Autonomous Driving Instance Segmentation
Code Code Available 05 Neural Radiance Field Codebooks Jan 10, 2023 Object Representation Learning
Code Code Available 05 Object-aware Sound Source Localization via Audio-Visual Scene Understanding Jan 1, 2025 Scene Understanding Sound Source Localization
Code Code Available 05 Dilated Residual Networks May 28, 2017 Classification General Classification
Code Code Available 05 A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection Inference Jul 19, 2024 Autonomous Vehicles object-detection
Code Code Available 05 Adapting Deep Network Features to Capture Psychological Representations Aug 6, 2016 Object Recognition Scene Understanding
Code Code Available 05