Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 1723 papers

Title	Date	Tasks	Status
Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation	Jun 1, 2019	Depth EstimationMonocular Depth Estimation	—Unverified
Towards Scene Understanding with Detailed 3D Object Representations	Nov 18, 2014	3D Pose EstimationObject	—Unverified
Towards seamless multi-view scene analysis from satellite to street-level	May 23, 2017	Change DetectionEarth Observation	—Unverified
Towards Trustworthy Automated Driving through Qualitative Scene Understanding and Explanations	Mar 25, 2024	Scene Understanding	—Unverified
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation	Jun 23, 2023	Graph GenerationScene Graph Generation	—Unverified
Towards urban scenes understanding through polarization cues	Jun 3, 2021	Depth EstimationScene Understanding	—Unverified
Training-Free Model Merging for Multi-target Domain Adaptation	Jul 18, 2024	Domain AdaptationMulti-target Domain Adaptation	—Unverified
TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication	Apr 2, 2025	Language ModelingLanguage Modelling	—Unverified
Transformers for Image-Goal Navigation	May 23, 2024	NavigateScene Understanding	—Unverified
TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance	Apr 23, 2025	Question AnsweringScene Understanding	—Unverified
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	May 22, 2024	3D Object Detection3D Semantic Segmentation	—Unverified
Two Stream Scene Understanding on Graph Embedding	Nov 12, 2023	Graph AttentionGraph Embedding	—Unverified
U4D: Unsupervised 4D Dynamic Scene Understanding	Jul 23, 2019	3D Pose EstimationInstance Segmentation	—Unverified
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information	Feb 25, 2023	DecoderImage Segmentation	—Unverified
Understanding and Evaluating Hallucinations in 3D Visual Language Models	Feb 18, 2025	DiversityScene Understanding	—Unverified
Understanding Bayesian Rooms Using Composite 3D Object Models	Jun 1, 2013	ObjectObject Recognition	—Unverified
Understanding Indoor Scenes Using 3D Geometric Phrases	Jun 1, 2013	General ClassificationObject	—Unverified
Understanding Pedestrian Behaviors From Stationary Crowd Groups	Jun 1, 2015	Event DetectionScene Understanding	—Unverified
Understanding Real World Indoor Scenes With Synthetic Data	Jun 1, 2016	Scene Understanding	—Unverified
Understand Scene Categories by Objects: A Semantic Regularized Scene Classifier Using Convolutional Neural Networks	Sep 22, 2015	ClassificationDiversity	—Unverified
Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement	May 26, 2025	Image Enhancementobject-detection	—Unverified
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs	Mar 3, 2023	Depth-aware Video Panoptic SegmentationPanoptic Segmentation	—Unverified
Unified Representation Space for 3D Visual Grounding	Jun 17, 2025	3D visual groundingContrastive Learning	—Unverified
Unified Scene Representation and Reconstruction for 3D Large Language Models	Apr 19, 2024	3D ReconstructionScene Understanding	—Unverified
Uni-Fusion: Universal Continuous Mapping	Mar 22, 2023	Scene Understanding	—Unverified
UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations	Nov 22, 2024	Autonomous DrivingScene Understanding	—Unverified
UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision	Dec 24, 2024	Scene UnderstandingSemantic Segmentation	—Unverified
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation	Jan 10, 2025	DecoderGraph Generation	—Unverified
UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration	Oct 30, 2024	Point Cloud RegistrationRepresentation Learning	—Unverified
Unsupervised 3D Structure Inference from Category-Specific Image Collections	Jan 1, 2024	Graph MatchingObject	—Unverified
Unsupervised Discovery and Composition of Object Light Fields	May 8, 2022	Novel View SynthesisObject	—Unverified
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation	Sep 30, 2021	Autonomous DrivingAutonomous Vehicles	—Unverified
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified
Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization	Jul 1, 2021	Image SegmentationScene Understanding	—Unverified
Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics	Jan 26, 2025	Object RecognitionScene Understanding	—Unverified
Urban Scene Diffusion through Semantic Occupancy Map	Mar 18, 2024	Image GenerationScene Understanding	—Unverified
User Identification: A Key Enabler for Multi-User Vision-Aided Communications	Oct 27, 2022	Scene UnderstandingUser Identification	—Unverified
Using Image Priors to Improve Scene Understanding	Oct 2, 2019	Autonomous DrivingAutonomous Vehicles	—Unverified
V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving	Apr 30, 2025	Autonomous DrivingDecision Making	—Unverified
VideoGameBunny: Towards vision assistants for video games	Jul 21, 2024	Image CaptioningScene Understanding	—Unverified
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation	Apr 10, 2023	Panoptic SegmentationScene Understanding	—Unverified
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving	Sep 16, 2024	Autonomous DrivingLogical Reasoning	—Unverified
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding	May 15, 2018	Scene ClassificationScene Understanding	—Unverified
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation	Apr 20, 2025	AttributeForeground Segmentation	—Unverified
Vision-Language Embodiment for Monocular Depth Estimation	Jan 1, 2025	3D ReconstructionDepth Estimation	—Unverified
Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding	Jan 9, 2025	Autonomous DrivingIn-Context Learning	—Unverified
Vision-Language Models Struggle to Align Entities across Modalities	Mar 5, 2025	AttributeCode Generation	—Unverified
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding	May 18, 2023	Contrastive LearningObject	—Unverified
Visual Affordance and Function Understanding: A Survey	Jul 18, 2018	Affordance DetectionScene Understanding	—Unverified
Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting	Mar 27, 2025	counterfactualObject	—Unverified

Show:10 25 50

← PrevPage 27 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified