Prospective Role of Foundation Models in Advancing Autonomous Vehicles Dec 8, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection Dec 5, 2023 3D Object Detection Denoising
Code Code Available 1IGFNet: Illumination-Guided Fusion Network for Semantic Scene Understanding using RGB-Thermal Images Dec 4, 2023 Autonomous Driving Scene Understanding
Code Code Available 0SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM Dec 4, 2023 Camera Pose Estimation Novel View Synthesis
Code Code Available 4Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Dec 4, 2023 Depth Estimation GPU
Code Code Available 4A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors Dec 3, 2023 Active Learning Instance Segmentation
— Unverified 0Segment Any 3D Gaussians Dec 1, 2023 Interactive Segmentation Scene Understanding
— Unverified 0Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment Dec 1, 2023 Contrastive Learning Few-Shot Learning
Code Code Available 3Gaussian Grouping: Segment and Edit Anything in 3D Scenes Dec 1, 2023 Colorization NeRF
Code Code Available 2Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding Nov 30, 2023 GPU Inductive Bias
Code Code Available 1SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation Nov 29, 2023 Scene Segmentation Scene Understanding
Code Code Available 1HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos Nov 28, 2023 Graph Generation Scene Graph Generation
— Unverified 0Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Nov 28, 2023 Clustering Diversity
— Unverified 0Panoptic Video Scene Graph Generation Nov 28, 2023 Graph Generation Panoptic Scene Graph Generation
Code Code Available 1REACT: Recognize Every Action Everywhere All At Once Nov 27, 2023 Action Recognition Activity Recognition
— Unverified 0FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding Nov 27, 2023 Continual Learning Continual Semantic Segmentation
— Unverified 0Multi-task Planar Reconstruction with Feature Warping Guidance Nov 25, 2023 3D Reconstruction Instance Segmentation
Code Code Available 0GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction Nov 24, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge Nov 21, 2023 Large Language Model Multimodal Deep Learning
Code Code Available 1GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Nov 20, 2023 Instance Segmentation NeRF
— Unverified 0SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles Nov 20, 2023 Change Detection Motion Planning
— Unverified 0SpectralGPT: Spectral Remote Sensing Foundation Model Nov 13, 2023 Change Detection model
Code Code Available 2Two Stream Scene Understanding on Graph Embedding Nov 12, 2023 Graph Attention Graph Embedding
— Unverified 0Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models Nov 11, 2023 Image Captioning MMR total
Code Code Available 3On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving Nov 9, 2023 Autonomous Driving Common Sense Reasoning
Code Code Available 2TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding Nov 6, 2023 Boundary Detection Depth Estimation
Code Code Available 1NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment Nov 5, 2023 Caption Generation Common Sense Reasoning
Code Code Available 1Continual Learning of Unsupervised Monocular Depth from Videos Nov 4, 2023 Autonomous Driving Continual Learning
Code Code Available 0Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Nov 3, 2023 3D Semantic Segmentation Point Cloud Segmentation
— Unverified 0Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture Nov 1, 2023 3D Object Reconstruction 3D Reconstruction
— Unverified 0TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain Nov 1, 2023 Contrastive Learning Image-to-Image Translation
Code Code Available 1Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation Oct 24, 2023 Autonomous Driving Scene Understanding
— Unverified 0P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 0Panoptic Out-of-Distribution Segmentation Oct 18, 2023 Data Augmentation Instance Segmentation
— Unverified 0S4C: Self-Supervised Semantic Scene Completion with Neural Fields Oct 11, 2023 Image Segmentation Navigate
— Unverified 0DualMLP: a two-stream fusion model for 3D point cloud classification Oct 10, 2023 3D Point Cloud Classification Point Cloud Classification
Code Code Available 0Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models Oct 10, 2023 Object Object Tracking
— Unverified 0TextPSG: Panoptic Scene Graph Generation from Textual Descriptions Oct 10, 2023 Graph Generation Panoptic Scene Graph Generation
— Unverified 0Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving Oct 3, 2023 Autonomous Driving Decision Making
Code Code Available 1TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation Oct 3, 2023 Autonomous Driving Scene Understanding
Code Code Available 1Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception Oct 2, 2023 Autonomous Driving Image Segmentation
— Unverified 0Adaptive Visual Scene Understanding: Incremental Scene Graph Generation Oct 2, 2023 Benchmarking Continual Learning
Code Code Available 0Logical Bias Learning for Object Relation Prediction Oct 1, 2023 Causal Inference Decision Making
— Unverified 0SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction Sep 27, 2023 Graph Learning Prediction
— Unverified 0Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms Sep 27, 2023 object-detection Object Detection
Code Code Available 1Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding Sep 26, 2023 Scene Understanding Simultaneous Localization and Mapping
— Unverified 0PanopticNDT: Efficient and Robust Panoptic Mapping Sep 24, 2023 2D Panoptic Segmentation 3D Panoptic Segmentation
Code Code Available 1SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset Sep 21, 2023 Autonomous Vehicles Depth Estimation
— Unverified 0LLMR: Real-time Prompting of Interactive Worlds using Large Language Models Sep 21, 2023 Language Modeling Language Modelling
— Unverified 0Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives Sep 21, 2023 Action Localization Action Recognition
— Unverified 0