Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1326–1350 of 1723 papers

Title	Date	Tasks	Status
UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations	Nov 22, 2024	Autonomous DrivingScene Understanding	—Unverified
UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision	Dec 24, 2024	Scene UnderstandingSemantic Segmentation	—Unverified
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation	Jan 10, 2025	DecoderGraph Generation	—Unverified
UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration	Oct 30, 2024	Point Cloud RegistrationRepresentation Learning	—Unverified
Unsupervised 3D Structure Inference from Category-Specific Image Collections	Jan 1, 2024	Graph MatchingObject	—Unverified
Unsupervised Discovery and Composition of Object Light Fields	May 8, 2022	Novel View SynthesisObject	—Unverified
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation	Sep 30, 2021	Autonomous DrivingAutonomous Vehicles	—Unverified
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified
Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization	Jul 1, 2021	Image SegmentationScene Understanding	—Unverified
Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics	Jan 26, 2025	Object RecognitionScene Understanding	—Unverified
Urban Scene Diffusion through Semantic Occupancy Map	Mar 18, 2024	Image GenerationScene Understanding	—Unverified
User Identification: A Key Enabler for Multi-User Vision-Aided Communications	Oct 27, 2022	Scene UnderstandingUser Identification	—Unverified
Using Image Priors to Improve Scene Understanding	Oct 2, 2019	Autonomous DrivingAutonomous Vehicles	—Unverified
V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving	Apr 30, 2025	Autonomous DrivingDecision Making	—Unverified
VideoGameBunny: Towards vision assistants for video games	Jul 21, 2024	Image CaptioningScene Understanding	—Unverified
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation	Apr 10, 2023	Panoptic SegmentationScene Understanding	—Unverified
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving	Sep 16, 2024	Autonomous DrivingLogical Reasoning	—Unverified
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding	May 15, 2018	Scene ClassificationScene Understanding	—Unverified
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation	Apr 20, 2025	AttributeForeground Segmentation	—Unverified
Vision-Language Embodiment for Monocular Depth Estimation	Jan 1, 2025	3D ReconstructionDepth Estimation	—Unverified
Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding	Jan 9, 2025	Autonomous DrivingIn-Context Learning	—Unverified
Vision-Language Models Struggle to Align Entities across Modalities	Mar 5, 2025	AttributeCode Generation	—Unverified
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding	May 18, 2023	Contrastive LearningObject	—Unverified
Visual Affordance and Function Understanding: A Survey	Jul 18, 2018	Affordance DetectionScene Understanding	—Unverified
Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting	Mar 27, 2025	counterfactualObject	—Unverified

Show:10 25 50

← PrevPage 54 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified