SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field Mar 21, 2024 3D Scene Reconstruction Autonomous Driving
— Unverified 0Exosense: A Vision-Based Scene Understanding System For Exoskeletons Mar 21, 2024 Language Modelling Motion Planning
— Unverified 0Geometric Constraints in Deep Learning Frameworks: A Survey Mar 19, 2024 Deep Learning Depth Estimation
— Unverified 0Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation Mar 19, 2024 Domain Adaptation Object
Code Code Available 0M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving Mar 19, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting Mar 19, 2024 Novel View Synthesis Scene Understanding
— Unverified 0Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation Mar 18, 2024 Common Sense Reasoning Efficient Exploration
Code Code Available 0Agent3D-Zero: An Agent for Zero-shot 3D Understanding Mar 18, 2024 Language Modelling Scene Understanding
— Unverified 0OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Mar 18, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 0R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding Mar 18, 2024 Object Relation Prediction
— Unverified 0Urban Scene Diffusion through Semantic Occupancy Map Mar 18, 2024 Image Generation Scene Understanding
— Unverified 0Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields Mar 17, 2024 3D Reconstruction NeRF
Code Code Available 0Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation Mar 16, 2024 Instance Segmentation Object
— Unverified 0N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields Mar 16, 2024 Scene Understanding
— Unverified 0Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning Mar 15, 2024 Autonomous Driving Human-Object Interaction Detection
— Unverified 0Mapping High-level Semantic Regions in Indoor Environments without Object Recognition Mar 11, 2024 Graph Generation Language Modeling
— Unverified 0Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes Mar 7, 2024 Motion Segmentation Optical Flow Estimation
— Unverified 0GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Mar 6, 2024 NeRF Scene Understanding
— Unverified 0HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes Mar 5, 2024 Scene Understanding
— Unverified 0PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds Feb 29, 2024 Depth Estimation Depth Prediction
— Unverified 0One model to use them all: Training a segmentation model with complementary datasets Feb 29, 2024 All Anatomy
Code Code Available 0LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment Feb 27, 2024 Scene Understanding
— Unverified 0AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding Feb 27, 2024 3D Object Detection 3D Part Segmentation
Code Code Available 0OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding Feb 23, 2024 Scene Understanding
— Unverified 0DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models Feb 19, 2024 Autonomous Driving Scene Understanding
— Unverified 0Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation Feb 14, 2024 Decoder Object
— Unverified 0InCoRo: In-Context Learning for Robotics Control with Feedback Loops Feb 7, 2024 In-Context Learning Scene Understanding
— Unverified 0Neural Language of Thought Models Feb 2, 2024 Image Generation Object
— Unverified 0Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Jan 31, 2024 Benchmarking Change Detection
Code Code Available 0Non-central panorama indoor dataset Jan 30, 2024 Scene Understanding
Code Code Available 0Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems Jan 23, 2024 Scene Classification Scene Recognition
— Unverified 0AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents Jan 23, 2024 Instruction Following Scene Understanding
— Unverified 0S^3M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving Jan 21, 2024 Autonomous Driving Scene Understanding
— Unverified 0ICGNet: A Unified Approach for Instance-Centric Grasping Jan 18, 2024 Object Object Reconstruction
Code Code Available 0BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection Jan 18, 2024 Diversity Scene Text Detection
— Unverified 0SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding Jan 17, 2024 3D visual grounding Scene Understanding
— Unverified 0Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization Jan 13, 2024 Pseudo Label Representation Learning
— Unverified 0Learning Segmented 3D Gaussians via Efficient Feature Unprojection for Zero-shot Neural Scene Segmentation Jan 11, 2024 Decoder Panoptic Segmentation
— Unverified 0Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection Jan 11, 2024 Human-Object Interaction Detection Knowledge Distillation
— Unverified 0VLP: Vision Language Planning for Autonomous Driving Jan 10, 2024 Autonomous Driving Motion Planning
— Unverified 0FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild Jan 8, 2024 Language Modelling Large Language Model
Code Code Available 0FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding Jan 3, 2024 object-detection Object Detection
— Unverified 0Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models Jan 1, 2024 Scene Understanding
— Unverified 0Unsupervised 3D Structure Inference from Category-Specific Image Collections Jan 1, 2024 Graph Matching Object
— Unverified 0When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach Jan 1, 2024 Scene Understanding Visual Grounding
— Unverified 0SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes Jan 1, 2024 Instance Segmentation Motion Estimation
— Unverified 0PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video Jan 1, 2024 3D Panoptic Segmentation 3D Reconstruction
Code Code Available 0Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness Jan 1, 2024 Human-Object Interaction Detection object-detection
— Unverified 0Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency Jan 1, 2024 3D visual grounding Relation
Code Code Available 0Omni-Q: Omni-Directional Scene Understanding for Unsupervised Visual Grounding Jan 1, 2024 Scene Understanding Visual Grounding
— Unverified 0