Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 1723 papers

Title	Date	Tasks	Status	Hype	Score
OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation	Jul 28, 2023	Autonomous DrivingScene Understanding	CodeCode Available	1	5
Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation	Dec 16, 2021	Feature ImportanceScene Understanding	CodeCode Available	1	5
IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation	Dec 20, 2019	Disparity EstimationScene Understanding	CodeCode Available	1	5
DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction	May 9, 2024	Contrastive LearningScene Understanding	CodeCode Available	1	5
Dynamic Graph Message Passing Networks	Aug 19, 2019	Image Classificationobject-detection	CodeCode Available	1	5
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D	Sep 28, 2021	Multiple Object TrackingNovel View Synthesis	CodeCode Available	1	5
Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation	Apr 22, 2023	Autonomous DrivingKnowledge Distillation	CodeCode Available	1	5
NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models	Mar 17, 2025	Question AnsweringScene Understanding	CodeCode Available	1	5
DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization	Apr 30, 2023	DecoderNeRF	CodeCode Available	1	5
Detecting Human-Object Interaction via Fabricated Compositional Learning	Mar 15, 2021	Affordance RecognitionHuman-Object Interaction Detection	CodeCode Available	1	5
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment	Nov 5, 2023	Caption GenerationCommon Sense Reasoning	CodeCode Available	1	5
Bidirectional Projection Network for Cross Dimension Scene Understanding	Mar 26, 2021	2D Semantic Segmentation3D Semantic Segmentation	CodeCode Available	1	5
Learning How To Robustly Estimate Camera Pose in Endoscopic Videos	Apr 17, 2023	3D ReconstructionCamera Pose Estimation	CodeCode Available	1	5
NODIS: Neural Ordinary Differential Scene Understanding	Jan 14, 2020	AllGraph Generation	CodeCode Available	1	5
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition	Aug 10, 2021	Action ClassificationAction Recognition	CodeCode Available	1	5
Dynamic Graph Message Passing Networks for Visual Recognition	Sep 20, 2022	image-classificationImage Classification	CodeCode Available	1	5
Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond	May 11, 2023	Scene Understanding	CodeCode Available	1	5
DPF: Learning Dense Prediction Fields with Weak Supervision	Mar 29, 2023	Intrinsic Image DecompositionPrediction	CodeCode Available	1	5
LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data	Sep 19, 2023	Anomaly DetectionAutonomous Driving	CodeCode Available	1	5
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Jul 15, 2024	AllImage Retrieval	CodeCode Available	1	5
Digging Into Self-Supervised Monocular Depth Estimation	Jun 4, 2018	Camera Pose EstimationDepth Estimation	CodeCode Available	1	5
Object Pose Estimation via the Aggregation of Diffusion Features	Mar 27, 2024	Pose EstimationScene Understanding	CodeCode Available	1	5
Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation	Jul 15, 2025	Large Language ModelScene Understanding	CodeCode Available	1	5
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images	Aug 6, 2021	Depth EstimationPanoptic Segmentation	CodeCode Available	1	5
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding	Apr 16, 2020	Human Part SegmentationPanoptic Segmentation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 16 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified