FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping Jun 4, 2024 3DGS Scene Understanding
— Unverified 0Object Aware Egocentric Online Action Detection Jun 3, 2024 Action Detection Object
— Unverified 0EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Jun 3, 2024 Domain Adaptation Open Vocabulary Semantic Segmentation
— Unverified 0CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos Jun 3, 2024 Graph Generation Scene Graph Generation
— Unverified 0Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Jun 2, 2024 Scene Parsing Scene Understanding
— Unverified 0Learning 3D Robotics Perception using Inductive Priors May 30, 2024 3D Reconstruction Image Generation
— Unverified 0SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation May 30, 2024 Instruction Following parameter-efficient fine-tuning
— Unverified 0Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding May 29, 2024 Scene Understanding Segmentation
— Unverified 0GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane May 27, 2024 3DGS feature selection
— Unverified 0Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding May 24, 2024 Scene Understanding Zero Shot Segmentation
— Unverified 0Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis May 23, 2024 Novel View Synthesis Scene Understanding
— Unverified 0Transformers for Image-Goal Navigation May 23, 2024 Navigate Scene Understanding
— Unverified 0GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games May 22, 2024 Code Generation Decision Making
— Unverified 0TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System May 22, 2024 3D Object Detection 3D Semantic Segmentation
— Unverified 0Anticipating Object State Changes in Long Procedural Videos May 21, 2024 Object Object State Change Classification
— Unverified 0A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance May 16, 2024 LIDAR Semantic Segmentation Scene Understanding
— Unverified 0BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation May 15, 2024 Dataset Generation Scene Understanding
— Unverified 03D Shape Augmentation with Content-Aware Shape Resizing May 15, 2024 3D Generation Scene Understanding
— Unverified 0DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving May 7, 2024 3D Object Detection Autonomous Driving
— Unverified 0Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM Apr 29, 2024 Phrase Grounding Scene Understanding
— Unverified 0Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer Apr 24, 2024 Grounded Situation Recognition Scene Understanding
— Unverified 0CloudFort: Enhancing Robustness of 3D Point Cloud Classification Against Backdoor Attacks via Spatial Partitioning and Ensemble Prediction Apr 22, 2024 3D Point Cloud Classification Autonomous Vehicles
— Unverified 0On Support Relations Inference and Scene Hierarchy Graph Construction from Point Cloud in Clustered Environments Apr 22, 2024 Combinatorial Optimization graph construction
— Unverified 0Unified Scene Representation and Reconstruction for 3D Large Language Models Apr 19, 2024 3D Reconstruction Scene Understanding
— Unverified 0BACS: Background Aware Continual Semantic Segmentation Apr 19, 2024 Autonomous Driving Continual Learning
Code Code Available 0AccidentBlip: Agent of Accident Warning based on MA-former Apr 18, 2024 Language Modelling Large Language Model
— Unverified 0Multimodal 3D Object Detection on Unseen Domains Apr 17, 2024 3D Object Detection Autonomous Driving
— Unverified 0PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network Apr 16, 2024 Autonomous Driving Feature Engineering
— Unverified 0Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange Apr 11, 2024 Object Scene Understanding
Code Code Available 0Depth Estimation using Weighted-loss and Transfer Learning Apr 11, 2024 Autonomous Vehicles Decoder
— Unverified 0Gaga: Group Any Gaussians via 3D-aware Memory Bank Apr 11, 2024 Contrastive Learning Object Tracking
— Unverified 0Incorporating Explanations into Human-Machine Interfaces for Trust and Situation Awareness in Autonomous Vehicles Apr 10, 2024 Autonomous Vehicles Scene Understanding
— Unverified 0O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Apr 10, 2024 Image Segmentation Object
— Unverified 0DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning Apr 9, 2024 BEV Segmentation Scene Understanding
— Unverified 0QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Apr 9, 2024 Scene Understanding Segmentation
— Unverified 0Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation Apr 6, 2024 Image Captioning Instance Segmentation
— Unverified 0You Only Scan Once: A Dynamic Scene Reconstruction Pipeline for 6-DoF Robotic Grasping of Novel Objects Apr 4, 2024 Object Pose Tracking
— Unverified 0MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements Apr 1, 2024 3DGS Scene Understanding
— Unverified 0360+x: A Panoptic Multi-modal Scene Understanding Dataset Apr 1, 2024 Scene Understanding
— Unverified 0Adapting to Length Shift: FlexiLength Network for Trajectory Prediction Mar 31, 2024 Autonomous Driving Prediction
— Unverified 0Neural Radiance Field-based Visual Rendering: A Comprehensive Review Mar 31, 2024 NeRF Scene Understanding
— Unverified 0HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes Mar 29, 2024 3DGS Autonomous Vehicles
— Unverified 0Efficient 3D Instance Mapping and Localization with Neural Fields Mar 28, 2024 3D Instance Segmentation Image Segmentation
— Unverified 0DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding Mar 25, 2024 Decoder Object
Code Code Available 0Towards Trustworthy Automated Driving through Qualitative Scene Understanding and Explanations Mar 25, 2024 Scene Understanding
— Unverified 0Semantic Is Enough: Only Semantic Information For NeRF Reconstruction Mar 24, 2024 NeRF object-detection
— Unverified 0Multi-Task Learning with Multi-Task Optimization Mar 24, 2024 Automated Theorem Proving image-classification
— Unverified 0DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Mar 22, 2024 Denoising Scene Understanding
— Unverified 0Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting Mar 22, 2024 Instance Segmentation Object Localization
— Unverified 03D Object Detection from Point Cloud via Voting Step Diffusion Mar 21, 2024 3D Object Detection Object
Code Code Available 0