Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1401–1450 of 1723 papers

Title	Date	Tasks	Status
PT-ResNet: Perspective Transformation-Based Residual Network for Semantic Road Image Segmentation	Oct 29, 2019	Image Segmentationroad scene understanding	—Unverified
Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM	Apr 29, 2024	Phrase GroundingScene Understanding	—Unverified
Quantifying the synthetic and real domain gap in aerial scene understanding	Nov 29, 2024	Domain AdaptationScene Understanding	—Unverified
QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding	Apr 9, 2024	Scene UnderstandingSegmentation	—Unverified
R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding	Mar 18, 2024	ObjectRelation Prediction	—Unverified
Radiation Search Operations using Scene Understanding with Autonomous UAV and UGV	Aug 31, 2016	Scene SegmentationScene Understanding	—Unverified
Radiometric Scene Decomposition: Scene Reflectance, Illumination, and Geometry from RGB-D Images	Apr 5, 2016	Scene Understanding	—Unverified
RAFT: Robust Augmentation of FeaTures for Image Segmentation	May 7, 2025	Active LearningDomain Adaptation	—Unverified
RailSem19: A Dataset for Semantic Rail Scene Understanding	Jun 16, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds	May 2, 2022	Autonomous DrivingDecoder	—Unverified
Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning	Sep 12, 2023	Autonomous VehiclesQuestion Answering	—Unverified
RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry	Mar 14, 2022	Monocular Visual OdometryMotion Estimation	—Unverified
RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration	Apr 9, 2025	3D Semantic SegmentationBenchmarking	—Unverified
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation	May 21, 2025	GPUNatural Language Queries	—Unverified
REACT: Recognize Every Action Everywhere All At Once	Nov 27, 2023	Action RecognitionActivity Recognition	—Unverified
RealGraph: A Multiview Dataset for 4D Real-world Context Graph Generation	Jan 1, 2023	Graph GenerationScene Understanding	—Unverified
Real time backbone for semantic segmentation	Mar 16, 2019	Autonomous DrivingModel Compression	—Unverified
Real-Time Semantic Stereo Matching	Oct 1, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Reasoning About Physical Interactions with Object-Centric Models	May 1, 2019	ObjectScene Understanding	—Unverified
Reasoning About Physical Interactions with Object-Oriented Prediction and Planning	Dec 28, 2018	ObjectScene Understanding	—Unverified
Reasoning with shapes: profiting cognitive susceptibilities to infer linear mapping transformations between shapes	Sep 1, 2017	Scene Understanding	—Unverified
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation	Oct 24, 2023	Autonomous DrivingScene Understanding	—Unverified
Recognizing Dynamic Scenes with Deep Dual Descriptor based on Key Frames and Key Segments	Feb 15, 2017	Scene RecognitionScene Understanding	—Unverified
Recognizing Material Properties from Images	Jan 9, 2018	Material ClassificationMaterial Recognition	—Unverified
Reconstructing Animals and the Wild	Nov 27, 2024	3D ReconstructionScene Understanding	—Unverified
Reconstructing Vechicles from a Single Image: Shape Priors for Road Scene Understanding	Sep 29, 2016	Autonomous Drivingroad scene understanding	—Unverified
Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing	Jun 5, 2023	Scene ParsingScene Understanding	—Unverified
Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Nov 18, 2024	Scene SegmentationScene Understanding	—Unverified
Referring Self-supervised Learning on 3D Point Cloud	Sep 29, 2021	Scene UnderstandingSelf-Supervised Learning	—Unverified
RefineCap: Concept-Aware Refinement for Image Captioning	Sep 8, 2021	DecoderDescriptive	—Unverified
Relationship Proposal Networks	Jul 1, 2017	AllScene Understanding	—Unverified
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration	Sep 21, 2024	Collision AvoidanceDecision Making	—Unverified
Relevance for Human Robot Collaboration	Sep 12, 2024	Dimensionality ReductionScene Understanding	—Unverified
REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision	Dec 1, 2021	3D Human Reconstruction3D Reconstruction	—Unverified
Residual 3D Scene Flow Learning with Context-Aware Feature Extraction	Sep 10, 2021	Autonomous DrivingScene Flow Estimation	—Unverified
Resource-Efficient Multiview Perception: Integrating Semantic Masking with Masked Autoencoders	Oct 7, 2024	Multiview DetectionScene Understanding	—Unverified
BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions	Dec 21, 2023	DecoderMulti-Task Learning	—Unverified
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets	Jul 29, 2024	DecoderScene Understanding	—Unverified
Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection	Jan 21, 2021	Autonomous NavigationModel Selection	—Unverified
VrR-VG: Refocusing Visually-Relevant Relationships	Feb 1, 2019	Image CaptioningQuestion Answering	—Unverified
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding	Dec 4, 2022	6D Pose Estimation using RGBObject	—Unverified
Review on Panoramic Imaging and Its Applications in Scene Understanding	May 11, 2022	Autonomous DrivingDepth Estimation	—Unverified
Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks	May 27, 2025	3D Scene ReconstructionDiagnostic	—Unverified
Road Rage Reasoning with Vision-language Models (VLMs): Task Definition and Evaluation Dataset	Mar 14, 2025	Scene Understanding	—Unverified
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets	May 21, 2025	Dataset GenerationDescriptive	—Unverified
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics	Nov 25, 2024	Robot ManipulationScene Understanding	—Unverified
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion	Nov 16, 2021	3D Semantic SegmentationAutonomous Driving	—Unverified
Robust Category-Level 3D Pose Estimation from Synthetic Data	May 25, 2023	3D Pose Estimation3D Reconstruction	—Unverified
Robust deep learning-based semantic organ segmentation in hyperspectral images	Nov 9, 2021	Deep LearningImage Segmentation	—Unverified
Robust Multi-Modal Image Stitching for Improved Scene Understanding	Dec 28, 2023	Image StitchingScene Understanding	—Unverified

Show:10 25 50

← PrevPage 29 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified