Robust Visual Localization via Semantic-Guided Multi-Scale Transformer Jun 10, 2025 regression Scene Understanding
— Unverified 0Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms Dec 10, 2021 3D Reconstruction Autonomous Navigation
— Unverified 0Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness Apr 2, 2025 Scene Understanding
— Unverified 0RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model Apr 7, 2025 Image Captioning image-classification
— Unverified 0S^3M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving Jan 21, 2024 Autonomous Driving Scene Understanding
— Unverified 0S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation Nov 4, 2020 Autonomous Driving Edge-computing
— Unverified 0S4C: Self-Supervised Semantic Scene Completion with Neural Fields Oct 11, 2023 Image Segmentation Navigate
— Unverified 0Safety Assessment for Autonomous Systems' Perception Capabilities Aug 17, 2022 Decision Making Scene Understanding
— Unverified 0SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data May 18, 2021 object-detection Object Detection
— Unverified 0SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes Jun 2, 2025 Scene Understanding
— Unverified 0SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation May 30, 2024 Instruction Following parameter-efficient fine-tuning
— Unverified 0SAM-Guided Masked Token Prediction for 3D Scene Understanding Oct 16, 2024 3D Object Detection Knowledge Distillation
— Unverified 0SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment Jun 1, 2022 Motion Planning Question Answering
— Unverified 0Scale-aware Neural Network for Semantic Segmentation of Multi-resolution Remote Sensing Images Mar 14, 2021 Scene Understanding Segmentation
— Unverified 0SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset Sep 21, 2023 Autonomous Vehicles Depth Estimation
— Unverified 0Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans Jun 6, 2022 Scene Understanding
— Unverified 0Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning Feb 19, 2025 Autonomous Driving Bench2Drive
— Unverified 0Scenarios: A New Representation for Complex Scene Understanding Feb 16, 2018 Image Retrieval Object Recognition
— Unverified 0Scene-aware Human Pose Generation using Transformer Aug 4, 2023 Knowledge Distillation Scene Understanding
— Unverified 0Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation Jul 5, 2022 Dialogue Generation Dialogue Understanding
— Unverified 0SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis Jun 12, 2025 Novel View Synthesis Scene Understanding
— Unverified 0Counterfactual Critic Multi-Agent Training for Scene Graph Generation Dec 6, 2018 counterfactual Graph Generation
— Unverified 0Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models Apr 6, 2025 Computational Efficiency General Knowledge
Code Code Available 0Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video May 27, 2019 Inductive Bias Model Predictive Control
Code Code Available 0PENet: A Joint Panoptic Edge Detection Network Mar 15, 2023 Edge Detection Multi-Task Learning
Code Code Available 0Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding Oct 19, 2024 Autonomous Driving object-detection
Code Code Available 0Parsing Natural Scenes and Natural Language with Recursive Neural Networks Jun 1, 2011 General Classification Scene Classification
Code Code Available 0Parsing Geometry Using Structure-Aware Shape Templates Aug 3, 2018 Object Object Recognition
Code Code Available 0Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing Dec 24, 2024 Autonomous Driving Autonomous Racing
Code Code Available 0Sequential Cross Attention Based Multi-task Learning Sep 6, 2022 Multi-Task Learning Scene Understanding
Code Code Available 0PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video Jan 1, 2024 3D Panoptic Segmentation 3D Reconstruction
Code Code Available 0Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes Aug 18, 2021 Camera Calibration Depth Estimation
Code Code Available 0P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 0SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation Nov 30, 2022 Graph Generation Image Generation
Code Code Available 0Pose-aware Multi-level Feature Network for Human Object Interaction Detection Sep 18, 2019 Human-Object Interaction Detection Object
Code Code Available 0OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Dec 31, 2024 3DGS 3D Semantic Segmentation
Code Code Available 0Dilated Residual Networks May 28, 2017 Classification General Classification
Code Code Available 0Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation Sep 24, 2018 Autonomous Driving Real-Time Semantic Segmentation
Code Code Available 0OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 0Unsupervised Domain Adaptation using Generative Adversarial Networks for Semantic Segmentation of Aerial Images May 8, 2019 Domain Adaptation Management
Code Code Available 0Predicting Deeper into the Future of Semantic Segmentation Mar 22, 2017 Attribute Autonomous Driving
Code Code Available 0Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Jul 14, 2024 3D Object Detection 3D Semantic Segmentation
Code Code Available 0Shape Anchor Guided Holistic Indoor Scene Understanding Sep 20, 2023 3D Object Detection object-detection
Code Code Available 0Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion Jun 10, 2022 Autonomous Driving Domain Adaptation
Code Code Available 0Improving Object Detection for Time-Lapse Imagery Using Temporal Features in Wildlife Monitoring Dec 20, 2024 Object object-detection
Code Code Available 0OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Jul 10, 2025 Scene Understanding Spatial Reasoning
Code Code Available 0OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Mar 18, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 0Impact of Ground Truth Annotation Quality on Performance of Semantic Image Segmentation of Traffic Conditions Dec 30, 2018 Autonomous Driving Image Segmentation
Code Code Available 0On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption Sep 2, 2020 Scene Understanding Segmentation
Code Code Available 0Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation Mar 19, 2024 Domain Adaptation Object
Code Code Available 0