MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding Apr 5, 2022 Autonomous Vehicles Scene Understanding
— Unverified 0P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior Apr 5, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 1BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation Apr 3, 2022 Decoder Depth Estimation
Code Code Available 2Online panoptic 3D reconstruction as a Linear Assignment Problem Apr 1, 2022 3D Reconstruction Image Segmentation
Code Code Available 1Point Scene Understanding via Disentangled Instance Mesh Reconstruction Mar 31, 2022 Retrieval Scene Understanding
Code Code Available 1Collaborative Transformers for Grounded Situation Recognition Mar 30, 2022 Grounded Situation Recognition Image Classification
Code Code Available 1Multi-Task Learning for Visual Scene Understanding Mar 28, 2022 Multi-Task Learning Scene Understanding
— Unverified 0Learning to Answer Questions in Dynamic Audio-Visual Scenarios Mar 26, 2022 audio-visual learning Audio-visual Question Answering
Code Code Available 1Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification Mar 25, 2022 Retrieval Scene Understanding
— Unverified 0Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering Mar 24, 2022 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Self-Supervised Road Layout Parsing with Graph Auto-Encoding Mar 21, 2022 Image Reconstruction Scene Understanding
Code Code Available 0Towards 3D Scene Understanding by Referring Synthetic Models Mar 20, 2022 Scene Understanding Transfer Learning
— Unverified 0Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows Mar 20, 2022 Human-Object Interaction Detection Object
— Unverified 0Deep Point Cloud Simplification for High-quality Surface Reconstruction Mar 17, 2022 Scene Understanding Surface Reconstruction
— Unverified 0MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering Mar 17, 2022 Implicit Relations Question Answering
Code Code Available 1Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans Mar 17, 2022 3D Object Recognition global-optimization
— Unverified 0WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection Mar 16, 2022 3D Object Detection Monocular 3D Object Detection
Code Code Available 1Deep learning for radar data exploitation of autonomous vehicle Mar 15, 2022 Autonomous Driving Deep Learning
Code Code Available 1InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding Mar 15, 2022 Boundary Detection Human Parsing
Code Code Available 2RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry Mar 14, 2022 Monocular Visual Odometry Motion Estimation
— Unverified 0CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers Mar 9, 2022 3D Object Detection Autonomous Vehicles
Code Code Available 2On Steering Multi-Annotations per Sample for Multi-Task Learning Mar 6, 2022 Instance Segmentation Multi-Task Learning
— Unverified 0Fast Neural Architecture Search for Lightweight Dense Prediction Networks Mar 3, 2022 Depth Estimation Image Super-Resolution
— Unverified 0Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection Mar 2, 2022 Content-Based Image Retrieval Deep Learning
— Unverified 0Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation Mar 2, 2022 Domain Adaptation Scene Understanding
Code Code Available 1TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation Feb 27, 2022 Autonomous Driving Knowledge Distillation
Code Code Available 1RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning Feb 26, 2022 3D Point Cloud Classification Point Cloud Segmentation
Code Code Available 1RescueNet: A High Resolution UAV Semantic Segmentation Benchmark Dataset for Natural Disaster Damage Assessment Feb 24, 2022 Scene Understanding Segmentation
Code Code Available 1GroupViT: Semantic Segmentation Emerges from Text Supervision Feb 22, 2022 Object Detection Scene Understanding
Code Code Available 2ReorientBot: Learning Object Reorientation for Specific-Posed Placement Feb 22, 2022 Motion Generation Motion Planning
Code Code Available 1Movies2Scenes: Using Movie Metadata to Learn Scene Representation Feb 22, 2022 Contrastive Learning Scene Understanding
— Unverified 03DRM:Pair-wise relation module for 3D object detection Feb 20, 2022 3D Object Detection Object
Code Code Available 1CARL-D: A vision benchmark suite and large scale dataset for vehicle detection and scene segmentation Feb 17, 2022 2D Object Detection Autonomous Driving
Code Code Available 0From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection Feb 15, 2022 Generalized Zero-Shot Object Detection Scene Understanding
Code Code Available 0HAKE: A Knowledge Engine Foundation for Human Activity Understanding Feb 14, 2022 Action Recognition Human-Object Interaction Detection
Code Code Available 2SafePicking: Learning Safe Object Extraction via Object-Level Mapping Feb 11, 2022 Motion Planning Object
Code Code Available 1Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics Feb 7, 2022 Autonomous Driving Depth Estimation
Code Code Available 1Catch Me if You Can: A Novel Task for Detection of Covert Geo-Locations (CGL) Feb 5, 2022 object-detection Object Detection
— Unverified 0StandardSim: A Synthetic Dataset For Retail Environments Feb 4, 2022 Change Detection Depth Estimation
— Unverified 0Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation Feb 2, 2022 PointGoal Navigation Scene Understanding
Code Code Available 0Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction Jan 28, 2022 3D Reconstruction Depth Estimation
Code Code Available 0Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding Jan 28, 2022 Graph Attention Knowledge Distillation
Code Code Available 1MonoDistill: Learning Spatial Features for Monocular 3D Object Detection Jan 26, 2022 3D Object Detection Monocular 3D Object Detection
Code Code Available 1Moving Beyond Navigation with Active Neural SLAM Jan 17, 2022 Domain Generalization motion prediction
— Unverified 0Towards holistic scene understanding: Semantic segmentation and beyond Jan 16, 2022 object-detection Object Detection
— Unverified 0Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety Jan 4, 2022 Decoder Deep Learning
— Unverified 0Scene Graph Generation: A Comprehensive Survey Jan 3, 2022 Graph Generation object-detection
— Unverified 0Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph Propagation Jan 1, 2022 Point Cloud Segmentation Scene Understanding
Code Code Available 0Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation Jan 1, 2022 3D Semantic Segmentation Autonomous Driving
— Unverified 0Glass Segmentation Using Intensity and Spectral Polarization Cues Jan 1, 2022 Camouflaged Object Segmentation Scene Understanding
— Unverified 0