Semantic Is Enough: Only Semantic Information For NeRF Reconstruction Mar 24, 2024 NeRF object-detection
— Unverified 0AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans Mar 24, 2024 3D Instance Segmentation Instance Segmentation
Code Code Available 1Multi-Task Learning with Multi-Task Optimization Mar 24, 2024 Automated Theorem Proving image-classification
— Unverified 0Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting Mar 22, 2024 Instance Segmentation Object Localization
— Unverified 0DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Mar 22, 2024 Denoising Scene Understanding
— Unverified 0Exosense: A Vision-Based Scene Understanding System For Exoskeletons Mar 21, 2024 Language Modelling Motion Planning
— Unverified 0SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field Mar 21, 2024 3D Scene Reconstruction Autonomous Driving
— Unverified 03D Object Detection from Point Cloud via Voting Step Diffusion Mar 21, 2024 3D Object Detection Object
Code Code Available 0Volumetric Environment Representation for Vision-Language Navigation Mar 21, 2024 3D geometry Multi-Task Learning
Code Code Available 2What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models Mar 20, 2024 counterfactual Hallucination
Code Code Available 1Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation Mar 19, 2024 Domain Adaptation Object
Code Code Available 0Geometric Constraints in Deep Learning Frameworks: A Survey Mar 19, 2024 Deep Learning Depth Estimation
— Unverified 0HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting Mar 19, 2024 Novel View Synthesis Scene Understanding
— Unverified 0M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving Mar 19, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding Mar 18, 2024 Object Relation Prediction
— Unverified 0OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation Mar 18, 2024 3D Reconstruction 3D Scene Reconstruction
Code Code Available 0Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation Mar 18, 2024 Common Sense Reasoning Efficient Exploration
Code Code Available 0Agent3D-Zero: An Agent for Zero-shot 3D Understanding Mar 18, 2024 Language Modelling Scene Understanding
— Unverified 0Urban Scene Diffusion through Semantic Occupancy Map Mar 18, 2024 Image Generation Scene Understanding
— Unverified 0Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields Mar 17, 2024 3D Reconstruction NeRF
Code Code Available 0N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields Mar 16, 2024 Scene Understanding
— Unverified 0Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation Mar 16, 2024 Instance Segmentation Object
— Unverified 0Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning Mar 15, 2024 Autonomous Driving Human-Object Interaction Detection
— Unverified 0GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding Mar 14, 2024 Contrastive Learning Representation Learning
Code Code Available 1MoAI: Mixture of All Intelligence for Large Language and Vision Models Mar 12, 2024 All Mixture-of-Experts
Code Code Available 3Mapping High-level Semantic Regions in Indoor Environments without Object Recognition Mar 11, 2024 Graph Generation Language Modeling
— Unverified 0Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer Mar 11, 2024 Anatomy Disentanglement
Code Code Available 1Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation Mar 8, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 1Embodied Understanding of Driving Scenarios Mar 7, 2024 Autonomous Driving Language Modeling
Code Code Available 3Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes Mar 7, 2024 Motion Segmentation Optical Flow Estimation
— Unverified 0GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Mar 6, 2024 NeRF Scene Understanding
— Unverified 0HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes Mar 5, 2024 Scene Understanding
— Unverified 0FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything Feb 29, 2024 3D Object Reconstruction Instance Segmentation
Code Code Available 2WHU-Synthetic: A Synthetic Perception Dataset for 3-D Multitask Model Research Feb 29, 2024 3D Reconstruction Attribute
Code Code Available 1One model to use them all: Training a segmentation model with complementary datasets Feb 29, 2024 All Anatomy
Code Code Available 0PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds Feb 29, 2024 Depth Estimation Depth Prediction
— Unverified 0LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment Feb 27, 2024 Scene Understanding
— Unverified 0AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding Feb 27, 2024 3D Object Detection 3D Part Segmentation
Code Code Available 0OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding Feb 23, 2024 Scene Understanding
— Unverified 0Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding Feb 22, 2024 Diversity Scene Understanding
Code Code Available 3DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models Feb 19, 2024 Autonomous Driving Scene Understanding
— Unverified 0Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review Feb 17, 2024 Panoptic Segmentation Scene Segmentation
Code Code Available 1Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation Feb 14, 2024 Decoder Object
— Unverified 0Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models Feb 12, 2024 Hallucination Object Localization
Code Code Available 4InCoRo: In-Context Learning for Robotics Control with Feedback Loops Feb 7, 2024 In-Context Learning Scene Understanding
— Unverified 0Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives Feb 5, 2024 Continual Learning Multi-Task Learning
Code Code Available 2SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM Feb 5, 2024 3D Semantic Segmentation Camera Pose Estimation
Code Code Available 3Neural Language of Thought Models Feb 2, 2024 Image Generation Object
— Unverified 0Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Jan 31, 2024 Benchmarking Change Detection
Code Code Available 0Non-central panorama indoor dataset Jan 30, 2024 Scene Understanding
Code Code Available 0