Dynamic Scene Understanding from Vision-Language Representations Jan 20, 2025 Grounded Situation Recognition Human-Human Interaction Recognition
— Unverified 00 MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents Dec 11, 2024 object-detection Object Detection
— Unverified 00 Making Large Language Models Better Planners with Reasoning-Decision Alignment Aug 25, 2024 Autonomous Driving Decision Making
— Unverified 00 Manhattan Scene Understanding via XSlit Imaging Jun 1, 2013 3D geometry Scene Understanding
— Unverified 00 Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving Sep 30, 2019 Autonomous Driving Decision Making
— Unverified 00 Mapping High-level Semantic Regions in Indoor Environments without Object Recognition Mar 11, 2024 Graph Generation Language Modeling
— Unverified 00 MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report Jun 14, 2024 Autonomous Driving Scene Understanding
— Unverified 00 Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors Feb 28, 2023 Contrastive Learning Instance Segmentation
— Unverified 00 Dynamic Clustering Transformer Network for Point Cloud Segmentation May 30, 2023 Clustering Decoder
— Unverified 00 MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation Mar 11, 2025 Image Segmentation Panoptic Segmentation
— Unverified 00 Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Apr 28, 2025 3D Semantic Segmentation Contrastive Learning
— Unverified 00 DublinCity: Annotated LiDAR Point Cloud and its Applications Sep 6, 2019 3D Reconstruction Scene Understanding
— Unverified 00 DSNet: An Efficient CNN for Road Scene Segmentation Apr 10, 2019 Autonomous Driving GPU
— Unverified 00 Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement May 26, 2025 Image Enhancement object-detection
— Unverified 00 Adapting to Length Shift: FlexiLength Network for Trajectory Prediction Mar 31, 2024 Autonomous Driving Prediction
— Unverified 00 DSM: Building A Diverse Semantic Map for 3D Visual Grounding Apr 11, 2025 3D visual grounding Scene Understanding
— Unverified 00 Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry Nov 17, 2024 Question Answering Scene Understanding
— Unverified 00 Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation Sep 28, 2019 Form Meta-Learning
— Unverified 00 MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning May 13, 2023 Deep Learning Depth Estimation
— Unverified 00 Active Scene Understanding via Online Semantic Reconstruction Jun 18, 2019 Scene Parsing Scene Understanding
— Unverified 00 A Continuous Occlusion Model for Road Scene Understanding Jun 1, 2016 model Motion Segmentation
— Unverified 00 Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Feb 23, 2025 3DGS 3D Semantic Segmentation
— Unverified 00 Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs Mar 3, 2023 Depth-aware Video Panoptic Segmentation Panoptic Segmentation
— Unverified 00 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving May 7, 2024 3D Object Detection Autonomous Driving
— Unverified 00 Minimal Adversarial Examples for Deep Learning on 3D Point Clouds Aug 27, 2020 3D Object Recognition Deep Learning
— Unverified 00 Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection Jul 19, 2023 Human-Object Interaction Detection Object
— Unverified 00 Content Adaptive Front End For Audio Classification Mar 18, 2023 Audio Classification Audio Signal Processing
— Unverified 00 DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models Feb 19, 2024 Autonomous Driving Scene Understanding
— Unverified 00 MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation Mar 23, 2025 Language Modeling Language Modelling
— Unverified 00 Unified Representation Space for 3D Visual Grounding Jun 17, 2025 3D visual grounding Contrastive Learning
— Unverified 00 Unified Scene Representation and Reconstruction for 3D Large Language Models Apr 19, 2024 3D Reconstruction Scene Understanding
— Unverified 00 DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder Nov 5, 2021 Autonomous Vehicles Image Segmentation
— Unverified 00 MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements Apr 1, 2024 3DGS Scene Understanding
— Unverified 00 MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency Dec 20, 2022 object-detection Object Detection
— Unverified 00 MNEW: Multi-domain Neighborhood Embedding and Weighting for Sparse Point Clouds Segmentation Apr 5, 2020 Autonomous Driving Scene Understanding
— Unverified 00 DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Aug 29, 2024 Autonomous Driving Denoising
— Unverified 00 Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding Aug 3, 2018 Scene Understanding Semantic Segmentation
— Unverified 00 DreamAnywhere: Object-Centric Panoramic 3D Scene Generation Jun 25, 2025 Novel View Synthesis Object
— Unverified 00 Uni-Fusion: Universal Continuous Mapping Mar 22, 2023 Scene Understanding
— Unverified 00 UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations Nov 22, 2024 Autonomous Driving Scene Understanding
— Unverified 00 Modeling human intuitions about liquid flow with particle-based simulation Sep 5, 2018 Scene Understanding
— Unverified 00 Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting Nov 4, 2024 Scene Understanding Uncertainty Quantification
— Unverified 00 DORSal: Diffusion for Object-centric Representations of Scenes et al Jun 13, 2023 Neural Rendering Object
— Unverified 00 DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation May 28, 2025 Autonomous Navigation RAG
— Unverified 00 A Comprehensive Review of Modern Object Segmentation Approaches Jan 13, 2023 Image Segmentation Object
— Unverified 00 Monocular BEV Perception of Road Scenes via Front-to-Top View Projection Nov 15, 2022 Autonomous Driving GPU
— Unverified 00 Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs Jun 5, 2025 cross-modal alignment Dense Captioning
— Unverified 00 Monocular Depth Estimation with Sharp Boundary Oct 12, 2021 Decoder Depth Estimation
— Unverified 00 Does CLIP perceive art the same way we do? May 8, 2025 Image Generation Scene Understanding
— Unverified 00 MonoGRNet: A General Framework for Monocular 3D Object Detection Apr 18, 2021 2D Object Detection 3D Object Detection
— Unverified 00