Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1723 papers

Title	Date	Tasks	Status
Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction	Jan 28, 2022	3D ReconstructionDepth Estimation	CodeCode Available
Moving Beyond Navigation with Active Neural SLAM	Jan 17, 2022	Domain Generalizationmotion prediction	—Unverified
Towards holistic scene understanding: Semantic segmentation and beyond	Jan 16, 2022	object-detectionObject Detection	—Unverified
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety	Jan 4, 2022	DecoderDeep Learning	—Unverified
Scene Graph Generation: A Comprehensive Survey	Jan 3, 2022	Graph Generationobject-detection	—Unverified
Glass Segmentation Using Intensity and Spectral Polarization Cues	Jan 1, 2022	Camouflaged Object SegmentationScene Understanding	—Unverified
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation	Jan 1, 2022	3D Semantic SegmentationAutonomous Driving	—Unverified
Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph Propagation	Jan 1, 2022	Point Cloud SegmentationScene Understanding	CodeCode Available
HSPACE: Synthetic Parametric Humans Animated in Complex Environments	Dec 23, 2021	3D Human Pose EstimationScene Understanding	—Unverified
Distillation of Human-Object Interaction Contexts for Action Recognition	Dec 17, 2021	Action RecognitionGraph Attention	—Unverified
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition	Dec 14, 2021	Human-Object Interaction DetectionScene Understanding	—Unverified
Image-to-Height Domain Translation for Synthetic Aperture Sonar	Dec 12, 2021	Generative Adversarial NetworkScene Understanding	—Unverified
3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map	Dec 10, 2021	Autonomous VehiclesNavigate	—Unverified
Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms	Dec 10, 2021	3D ReconstructionAutonomous Navigation	—Unverified
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding	Dec 6, 2021	3D Instance Segmentation3D Semantic Segmentation	—Unverified
Joint Modeling of Visual Objects and Relations for Scene Graph Generation	Dec 1, 2021	Graph EmbeddingGraph Generation	—Unverified
Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans	Dec 1, 2021	4D Panoptic SegmentationAutonomous Navigation	CodeCode Available
REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision	Dec 1, 2021	3D Human Reconstruction3D Reconstruction	—Unverified
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding	Dec 1, 2021	DisentanglementDomain Adaptation	—Unverified
Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding	Nov 30, 2021	Domain AdaptationLanguage Modeling	—Unverified
DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes	Nov 30, 2021	FrictionObject	—Unverified
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds	Nov 28, 2021	3D Shape Classificationgraph construction	—Unverified
Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation	Nov 26, 2021	AllGraph Generation	—Unverified
Joint stereo 3D object detection and implicit surface reconstruction	Nov 25, 2021	3D Object DetectionHallucination	CodeCode Available
Panoptic Segmentation Meets Remote Sensing	Nov 23, 2021	Panoptic SegmentationScene Understanding	—Unverified
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot	Nov 22, 2021	Scene Understanding	—Unverified
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion	Nov 16, 2021	3D Semantic SegmentationAutonomous Driving	—Unverified
Robust deep learning-based semantic organ segmentation in hyperspectral images	Nov 9, 2021	Deep LearningImage Segmentation	—Unverified
DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder	Nov 5, 2021	Autonomous VehiclesImage Segmentation	—Unverified
When Neural Networks Using Different Sensors Create Similar Features	Nov 4, 2021	Autonomous DrivingClassification	—Unverified
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety	Oct 22, 2021	Scene Understanding	—Unverified
Adversarial Scene Reconstruction and Object Detection System for Assisting Autonomous Vehicle	Oct 13, 2021	Deep Learningobject-detection	—Unverified
Monocular Depth Estimation with Sharp Boundary	Oct 12, 2021	DecoderDepth Estimation	—Unverified
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation	Sep 30, 2021	Autonomous DrivingAutonomous Vehicles	—Unverified
Semantic Dense Reconstruction with Consistent Scene Segments	Sep 30, 2021	3D Scene ReconstructionScene Understanding	—Unverified
Referring Self-supervised Learning on 3D Point Cloud	Sep 29, 2021	Scene UnderstandingSelf-Supervised Learning	—Unverified
D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation	Sep 29, 2021	Depth EstimationMonocular Depth Estimation	CodeCode Available
Efficient Point Transformer for Large-scale 3D Scene Understanding	Sep 29, 2021	3D Semantic SegmentationQuantization	—Unverified
Audio-Visual Collaborative Representation Learning for Dynamic Saliency Prediction	Sep 17, 2021	Representation LearningSaliency Prediction	—Unverified
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning	Sep 16, 2021	DecoderImage Captioning	CodeCode Available
Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images	Sep 15, 2021	Autonomous NavigationDecision Making	—Unverified
On the Sins of Image Synthesis Loss for Self-supervised Depth Estimation	Sep 13, 2021	AttributeDepth Estimation	—Unverified
Residual 3D Scene Flow Learning with Context-Aware Feature Extraction	Sep 10, 2021	Autonomous DrivingScene Flow Estimation	—Unverified
Single Image 3D Object Estimation with Primitive Graph Networks	Sep 9, 2021	Graph Neural NetworkObject	CodeCode Available
RefineCap: Concept-Aware Refinement for Image Captioning	Sep 8, 2021	DecoderDescriptive	—Unverified
Improving Building Segmentation for Off-Nadir Satellite Imagery	Sep 8, 2021	Scene UnderstandingSegmentation	—Unverified
Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds	Sep 6, 2021	Scene UnderstandingSuper-Resolution	—Unverified
Multi-task learning from fixed-wing UAV images for 2D/3D city modeling	Aug 25, 2021	Change DetectionDepth Estimation	—Unverified
Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks	Aug 23, 2021	Face RecognitionObject Recognition	—Unverified
A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces	Aug 20, 2021	3D ReconstructionPrediction	—Unverified

Show:10 25 50

← PrevPage 25 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified