Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 1723 papers

Title	Date	Tasks	Status	Hype	Score
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments	Jul 10, 2022	Instance SegmentationPanoptic Segmentation	CodeCode Available	1	5
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving	Jun 1, 2020	3D Object DetectionAutonomous Driving	CodeCode Available	1	5
Lane Graph Estimation for Scene Understanding in Urban Driving	May 1, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1	5
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1	5
AeroRIT: A New Scene for Hyperspectral Image Analysis	Dec 17, 2019	Hyperspectral image analysisImage Super-Resolution	CodeCode Available	1	5
Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding	Nov 9, 2015	Decision MakingDecoder	CodeCode Available	1	5
ODAM: Object Detection, Association, and Mapping using Posed RGB Video	Aug 23, 2021	3D Object DetectionGraph Neural Network	CodeCode Available	1	5
ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation	Apr 16, 2024	3D Semantic SegmentationManagement	CodeCode Available	1	5
Image Masking for Robust Self-Supervised Monocular Depth Estimation	Oct 5, 2022	Autonomous DrivingDepth Estimation	CodeCode Available	1	5
DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization	Aug 24, 2021	DiversityGraph Neural Network	CodeCode Available	1	5
Behind the Curtain: Learning Occluded Shapes for 3D Object Detection	Dec 4, 2021	3D Object DetectionObject	CodeCode Available	1	5
Image Segmentation Using Deep Learning: A Survey	Jan 15, 2020	DecoderDeep Learning	CodeCode Available	1	5
Estimating Generic 3D Room Structures from 2D Annotations	Jun 15, 2023	Scene Understanding	CodeCode Available	1	5
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1	5
DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects	Mar 27, 2018	General ClassificationObject	CodeCode Available	1	5
RGB-D Indiscernible Object Counting in Underwater Scenes	Apr 23, 2023	BenchmarkingDepth Estimation	CodeCode Available	1	5
Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation	Mar 2, 2022	Domain AdaptationScene Understanding	CodeCode Available	1	5
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks	Aug 17, 2021	3D Instance SegmentationInstance Segmentation	CodeCode Available	1	5
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation	Dec 22, 2021	Common Sense ReasoningQuestion Answering	CodeCode Available	1	5
ReorientBot: Learning Object Reorientation for Specific-Posed Placement	Feb 22, 2022	Motion GenerationMotion Planning	CodeCode Available	1	5
Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy	Feb 15, 2025	Point Cloud RegistrationScene Understanding	CodeCode Available	1	5
Beyond Appearances: Material Segmentation with Embedded Spectral Information from RGB-D imagery	May 17, 2024	Material ClassificationMaterial Recognition	CodeCode Available	1	5
Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration	Dec 17, 2024	audio-visual event localizationaudio-visual learning	CodeCode Available	1	5
Instance-wise Occlusion and Depth Orders in Natural Scenes	Nov 29, 2021	Depth EstimationDepth Prediction	CodeCode Available	1	5
OFFSEG: A Semantic Segmentation Framework For Off-Road Driving	Mar 23, 2021	Scene UnderstandingSegmentation	CodeCode Available	1	5
Online 3D reconstruction and dense tracking in endoscopic videos	Sep 9, 2024	3D Reconstruction3D Scene Reconstruction	CodeCode Available	1	5
Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation	Jul 23, 2021	Domain AdaptationFew-Shot Learning	CodeCode Available	1	5
Dual-Hybrid Attention Network for Specular Highlight Removal	Jul 17, 2024	highlight removalObject Recognition	CodeCode Available	1	5
OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation	Jul 28, 2023	Autonomous DrivingScene Understanding	CodeCode Available	1	5
Joint 2D-3D-Semantic Data for Indoor Scene Understanding	Feb 3, 2017	Scene Understanding	CodeCode Available	1	5
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D	Sep 28, 2021	Multiple Object TrackingNovel View Synthesis	CodeCode Available	1	5
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments	Dec 8, 2020	Camera RelocalizationRobot Navigation	CodeCode Available	1	5
DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction	May 9, 2024	Contrastive LearningScene Understanding	CodeCode Available	1	5
Dynamic Graph Message Passing Networks	Aug 19, 2019	Image Classificationobject-detection	CodeCode Available	1	5
NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models	Mar 17, 2025	Question AnsweringScene Understanding	CodeCode Available	1	5
DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization	Apr 30, 2023	DecoderNeRF	CodeCode Available	1	5
Bidirectional Projection Network for Cross Dimension Scene Understanding	Mar 26, 2021	2D Semantic Segmentation3D Semantic Segmentation	CodeCode Available	1	5
Learning and Reasoning with the Graph Structure Representation in Robotic Surgery	Jul 7, 2020	Edge ClassificationGraph Generation	CodeCode Available	1	5
Learning How To Robustly Estimate Camera Pose in Endoscopic Videos	Apr 17, 2023	3D ReconstructionCamera Pose Estimation	CodeCode Available	1	5
Learning Human-Object Interaction Detection using Interaction Points	Mar 31, 2020	Human-Object Interaction DetectionKeypoint Detection	CodeCode Available	1	5
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment	Nov 5, 2023	Caption GenerationCommon Sense Reasoning	CodeCode Available	1	5
NODIS: Neural Ordinary Differential Scene Understanding	Jan 14, 2020	AllGraph Generation	CodeCode Available	1	5
LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data	Sep 19, 2023	Anomaly DetectionAutonomous Driving	CodeCode Available	1	5
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection	Dec 19, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1	5
Dynamic Graph Message Passing Networks for Visual Recognition	Sep 20, 2022	image-classificationImage Classification	CodeCode Available	1	5
Digging Into Self-Supervised Monocular Depth Estimation	Jun 4, 2018	Camera Pose EstimationDepth Estimation	CodeCode Available	1	5
DPF: Learning Dense Prediction Fields with Weak Supervision	Mar 29, 2023	Intrinsic Image DecompositionPrediction	CodeCode Available	1	5
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Jul 15, 2024	AllImage Retrieval	CodeCode Available	1	5
Object Pose Estimation via the Aggregation of Diffusion Features	Mar 27, 2024	Pose EstimationScene Understanding	CodeCode Available	1	5
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding	Apr 16, 2020	Human Part SegmentationPanoptic Segmentation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 8 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified