Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering Mar 24, 2022 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 00 Towards General Purpose Geometry-Preserving Single-View Depth Estimation Sep 25, 2020 Depth Estimation Diversity
— Unverified 00 An Analysis of State-of-the-Art Models for Situated Interactive MultiModal Conversations (SIMMC) Jul 1, 2021 Scene Understanding
— Unverified 00 Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Jun 1, 2013 Image Segmentation object-detection
— Unverified 00 When Neural Networks Using Different Sensors Create Similar Features Nov 4, 2021 Autonomous Driving Classification
— Unverified 00 Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models Dec 1, 2010 Classification Depth Estimation
— Unverified 00 Towards holistic scene understanding: Semantic segmentation and beyond Jan 16, 2022 object-detection Object Detection
— Unverified 00 Analogical Image Translation for Fog Generation Jun 28, 2020 Image-to-Image Translation Scene Understanding
— Unverified 00 A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth Jun 13, 2022 Object object-detection
— Unverified 00 A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces Aug 20, 2021 3D Reconstruction Prediction
— Unverified 00 From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction Mar 20, 2025 3D Reconstruction Anatomy
— Unverified 00 Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data Sep 10, 2024 3D Plane Detection 3d scene graph generation
— Unverified 00 From Real to Synthetic and Back: Synthesizing Training Data for Multi-Person Scene Understanding Jun 3, 2020 Depth Estimation Generative Adversarial Network
— Unverified 00 Fully Convolutional Networks for Dense Semantic Labelling of High-Resolution Aerial Imagery Jun 8, 2016 Scene Understanding Vocal Bursts Intensity Prediction
— Unverified 00 Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents Sep 27, 2022 3D Object Detection Autonomous Driving
— Unverified 00 Fusion Based Holistic Road Scene Understanding Jun 29, 2014 Clustering Image Segmentation
— Unverified 00 FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation Aug 26, 2024 Autonomous Driving Image Segmentation
— Unverified 00 From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation May 23, 2025 3DGS 3D Reconstruction
— Unverified 00 Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences Sep 6, 2024 3D Object Detection Autonomous Driving
— Unverified 00 FroDO: From Detections to 3D Objects Jun 1, 2020 3D Reconstruction Object
— Unverified 00 Gaga: Group Any Gaussians via 3D-aware Memory Bank Apr 11, 2024 Contrastive Learning Object Tracking
— Unverified 00 GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting Dec 18, 2024 Scene Understanding Semantic Segmentation
— Unverified 00 A model of saliency-based visual attention for rapid scene analysis Nov 1, 1998 Saliency Prediction Scene Understanding
— Unverified 00 Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning Dec 1, 2015 Friction Scene Understanding
— Unverified 00 FroDO: From Detections to 3D Objects May 11, 2020 3D Reconstruction Object
— Unverified 00 GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games May 22, 2024 Code Generation Decision Making
— Unverified 00 GANspection Oct 21, 2019 Scene Understanding
— Unverified 00 Friction from Reflectance: Deep Reflectance Codes for Predicting Physical Surface Properties from One-Shot In-Field Reflectance Mar 25, 2016 Friction Scene Understanding
— Unverified 00 AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving Sep 12, 2023 Autonomous Driving Benchmarking
— Unverified 00 A Minimalist Approach to Type-Agnostic Detection of Quadrics in Point Clouds Mar 19, 2018 Scene Understanding
— Unverified 00 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Jul 19, 2024 BEV Segmentation Scene Understanding
— Unverified 00 FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding Jun 16, 2025 Form Graph Generation
— Unverified 00 Framework for 2D Ad placements in LinearTV Dec 5, 2022 Occlusion Handling Scene Understanding
— Unverified 00 GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting Sep 3, 2024 3DGS GPU
— Unverified 00 Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data Dec 7, 2022 Scene Understanding Segmentation
— Unverified 00 Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization Apr 14, 2025 Benchmarking Earth Observation
— Unverified 00 FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents Apr 11, 2025 3DGS Navigate
— Unverified 00 Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin-based Scene Representation Oct 26, 2024 Informativeness Scene Understanding
— Unverified 00 General-Purpose Aerial Intelligent Agents Empowered by Large Language Models Mar 11, 2025 Motion Planning Scene Understanding
— Unverified 00 Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper Feb 2, 2017 Active Learning GPU
— Unverified 00 Generating Robot Constitutions & Benchmarks for Semantic Safety Mar 11, 2025 Collision Avoidance Image Generation
— Unverified 00 FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding Jan 3, 2024 object-detection Object Detection
— Unverified 00 Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis May 23, 2024 Novel View Synthesis Scene Understanding
— Unverified 00 Generative Video Transformer: Can Objects be the Words? Jul 20, 2021 GPU Scene Understanding
— Unverified 00 FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition Nov 8, 2020 Action Recognition Optical Flow Estimation
— Unverified 00 Geometric Constrained Non-Line-of-Sight Imaging Mar 23, 2025 Scene Understanding Surface Reconstruction
— Unverified 00 Geometric Constraints in Deep Learning Frameworks: A Survey Mar 19, 2024 Deep Learning Depth Estimation
— Unverified 00 GeomGS: LiDAR-Guided Geometry-Aware Gaussian Splatting for Robot Localization Jan 23, 2025 3DGS Autonomous Driving
— Unverified 00 Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction Mar 1, 2025 GPU Pose Estimation
— Unverified 00 Glass Segmentation Using Intensity and Spectral Polarization Cues Jan 1, 2022 Camouflaged Object Segmentation Scene Understanding
— Unverified 00