PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics Sep 11, 2023 3D Reconstruction Camera Localization
— Unverified 0Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving Sep 11, 2023 Autonomous Driving Descriptive
— Unverified 0Weakly Supervised Point Clouds Transformer for 3D Object Detection Sep 8, 2023 3D Object Detection Object
— Unverified 0Structural Concept Learning via Graph Attention for Multi-Level Rearrangement Planning Sep 5, 2023 Graph Attention Object Rearrangement
— Unverified 0Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception Aug 31, 2023 Activity Recognition Human Activity Recognition
— Unverified 0Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation Aug 28, 2023 Autonomous Vehicles Depth Estimation
— Unverified 0Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation Aug 27, 2023 Contrastive Learning Domain Adaptation
— Unverified 0End-to-end Autonomous Driving using Deep Learning: A Systematic Review Aug 27, 2023 Autonomous Driving object-detection
— Unverified 0SurGNN: Explainable visual scene understanding and assessment of surgical skill using graph neural networks Aug 24, 2023 Scene Understanding
— Unverified 0Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views Aug 22, 2023 NeRF Neural Rendering
— Unverified 0Explore and Tell: Embodied Visual Captioning in 3D Environments Aug 21, 2023 Image Captioning Navigate
— Unverified 0CASPNet++: Joint Multi-Agent Motion Prediction Aug 15, 2023 Autonomous Driving motion prediction
— Unverified 0Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction Aug 8, 2023 Activity Recognition Autonomous Driving
— Unverified 0Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities Aug 6, 2023 Depth Estimation Instance Segmentation
— Unverified 0Cognitive TransFuser: Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint Prediction Aug 4, 2023 Imitation Learning Scene Understanding
Code Code Available 0Scene-aware Human Pose Generation using Transformer Aug 4, 2023 Knowledge Distillation Scene Understanding
— Unverified 0Weakly Supervised 3D Instance Segmentation without Instance-level Annotations Aug 3, 2023 3D Instance Segmentation Instance Segmentation
— Unverified 0Interpretable End-to-End Driving Model for Implicit Scene Understanding Aug 2, 2023 Graph Generation object-detection
— Unverified 0Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding Aug 1, 2023 3D geometry 3D Open-Vocabulary Instance Segmentation
— Unverified 0Gated Driver Attention Predictor Aug 1, 2023 Driver Attention Monitoring Prediction
Code Code Available 0Enhancing image captioning with depth information using a Transformer-based framework Jul 24, 2023 Image Captioning Image Paragraph Captioning
— Unverified 0Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery Jul 22, 2023 Continual Learning Scene Understanding
Code Code Available 0Challenges for Monocular 6D Object Pose Estimation in Robotics Jul 22, 2023 6D Pose Estimation using RGB Object
— Unverified 0Improving Online Lane Graph Extraction by Object-Lane Clustering Jul 20, 2023 3D Object Detection Autonomous Driving
— Unverified 0Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection Jul 19, 2023 Human-Object Interaction Detection Object
— Unverified 0Towards A Unified Agent with Foundation Models Jul 18, 2023 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0Human Action Recognition in Still Images Using ConViT Jul 18, 2023 Action Recognition Action Recognition In Still Images
— Unverified 0DeepIPCv2: LiDAR-powered Robust Environmental Perception and Navigational Control for Autonomous Vehicle Jul 13, 2023 Autonomous Driving Scene Understanding
Code Code Available 0Smart Infrastructure: A Research Junction Jul 12, 2023 Scene Understanding Synthetic Data Generation
— Unverified 0Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation Jul 10, 2023 Scene Understanding Semantic Segmentation
— Unverified 0PSDR-Room: Single Photo to Scene using Differentiable Rendering Jul 6, 2023 Scene Understanding
— Unverified 0Object Recognition System on a Tactile Device for Visually Impaired Jul 5, 2023 object-detection Object Detection
— Unverified 0Artifacts Mapping: Multi-Modal Semantic Mapping for Object Detection and 3D Localization Jul 3, 2023 object-detection Object Detection
— Unverified 0Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis Jun 28, 2023 Scene Understanding
Code Code Available 0Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties Jun 27, 2023 Friction Scene Understanding
— Unverified 0Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos Jun 27, 2023 Multi-Task Learning Scene Understanding
— Unverified 0Semantic-aware Transmission for Robust Point Cloud Classification Jun 23, 2023 Classification Decoder
— Unverified 0Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation Jun 23, 2023 Graph Generation Scene Graph Generation
— Unverified 0CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation Jun 17, 2023 Decision Making Instruction Following
— Unverified 0DORSal: Diffusion for Object-centric Representations of Scenes et al Jun 13, 2023 Neural Rendering Object
— Unverified 0Neural Projection Mapping Using Reflectance Fields Jun 11, 2023 Scene Understanding
— Unverified 0SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding Jun 9, 2023 Scene Understanding
— Unverified 0A Dynamic Feature Interaction Framework for Multi-task Visual Perception Jun 8, 2023 Autonomous Driving Depth Estimation
— Unverified 0TopoMask: Instance-Mask-Based Formulation for the Road Topology Problem via Transformer-Based Architecture Jun 8, 2023 3D Lane Detection Graph Neural Network
— Unverified 0Disaster Anomaly Detector via Deeper FCDDs for Explainable Initial Responses Jun 5, 2023 Anomaly Detection Disaster Response
— Unverified 0Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing Jun 5, 2023 Scene Parsing Scene Understanding
— Unverified 0Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes Jun 4, 2023 Common Sense Reasoning Question Answering
— Unverified 0Self-supervised Vision Transformers for 3D Pose Estimation of Novel Objects May 31, 2023 3D Pose Estimation Contrastive Learning
Code Code Available 0Dynamic Clustering Transformer Network for Point Cloud Segmentation May 30, 2023 Clustering Decoder
— Unverified 0Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation May 30, 2023 Graph Generation Image Generation
Code Code Available 0