Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1723 papers

Title	Date	Tasks	Status
Joint Modeling of Visual Objects and Relations for Scene Graph Generation	Dec 1, 2021	Graph EmbeddingGraph Generation	—Unverified
Joint Optical Flow and Temporally Consistent Semantic Segmentation	Jul 26, 2016	Motion EstimationOptical Flow Estimation	—Unverified
Joint prototype and coefficient prediction for 3D instance segmentation	Jul 9, 2024	3D Instance SegmentationInstance Segmentation	—Unverified
Joint Semantic and Motion Segmentation for dynamic scenes using Deep Convolutional Networks	Apr 18, 2017	Motion SegmentationOptical Flow Estimation	—Unverified
Joint SFM and Detection Cues for Monocular 3D Localization in Road Scenes	Jun 1, 2015	Autonomous DrivingMotion Segmentation	—Unverified
JUMPS: Joints Upsampling Method for Pose Sequences	Jul 2, 2020	Action RecognitionPose Estimation	—Unverified
Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding	May 29, 2024	Scene UnderstandingSegmentation	—Unverified
Knowledge Distillation for Incremental Learning in Semantic Segmentation	Nov 8, 2019	image-classificationImage Classification	—Unverified
Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation	Mar 10, 2020	Domain AdaptationScene Understanding	—Unverified
Label-Efficient LiDAR Panoptic Segmentation	Mar 4, 2025	Instance SegmentationPanoptic Segmentation	—Unverified
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Dec 23, 2024	3D Semantic SegmentationScene Understanding	—Unverified
Language-Assisted 3D Scene Understanding	Dec 18, 2023	3D Object Detection3D Semantic Segmentation	—Unverified
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding	Sep 26, 2023	Scene UnderstandingSimultaneous Localization and Mapping	—Unverified
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Jan 7, 2025	Autonomous DrivingContrastive Learning	—Unverified
Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges	Oct 20, 2024	Autonomous DrivingDecision Making	—Unverified
Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm	Nov 16, 2024	Autonomous VehiclesDecision Making	—Unverified
Large Margin Learning of Upstream Scene Understanding Models	Dec 1, 2010	General ClassificationPrediction	—Unverified
LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning	Jun 29, 2016	General ClassificationPedestrian Detection	—Unverified
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment	Jun 17, 2025	Autonomous DrivingInstance Segmentation	—Unverified
Leaky Wave Antenna-Equipped RF Chipless Tags for Orientation Estimation	Aug 31, 2024	Scene UnderstandingTAG	—Unverified
Learning 3D Robotics Perception using Inductive Priors	May 30, 2024	3D ReconstructionImage Generation	—Unverified
Learning 3D Scene Priors with 2D Supervision	Nov 25, 2022	DecoderScene Understanding	—Unverified
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions	Apr 8, 2020	3d scene graph generation3D Semantic Segmentation	—Unverified
Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation	Jan 6, 2020	3D Instance SegmentationInstance Segmentation	—Unverified
Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey	Mar 17, 2025	3D ReconstructionAutonomous Driving	—Unverified
Learning-based Relational Object Matching Across Views	May 3, 2023	Graph Neural NetworkImage Retrieval	—Unverified
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation	Sep 28, 2020	Instance SegmentationPanoptic Segmentation	—Unverified
Learning Densities in Feature Space for Reliable Segmentation of Indoor Scenes	Aug 1, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length	Mar 27, 2018	Depth EstimationNetwork Embedding	—Unverified
Learning Direct Optimization for Scene Understanding	Dec 18, 2018	Scene Understanding	—Unverified
Learning from Maps: Visual Common Sense for Autonomous Driving	Nov 25, 2016	Autonomous DrivingAutonomous Vehicles	—Unverified
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation	Jan 26, 2023	FairnessLIDAR Semantic Segmentation	—Unverified
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs	Jan 1, 2023	Scene Understanding	—Unverified
Learning in Audio-visual Context: A Review, Analysis, and New Perspective	Aug 20, 2022	audio-visual learningScene Understanding	—Unverified
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes	Jan 1, 2024	Instance SegmentationMotion Estimation	—Unverified
SceneGPT: A Language Model for 3D Scene Understanding	Aug 13, 2024	In-Context LearningLanguage Modeling	—Unverified
Scene Graph Generation: A Comprehensive Survey	Jan 3, 2022	Graph Generationobject-detection	—Unverified
A Comprehensive Survey of Scene Graphs: Generation and Application	Mar 17, 2021	Image CaptioningQuestion Answering	—Unverified
Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding	Oct 14, 2014	Binary ClassificationDecision Making	—Unverified
Scene-Independent Group Profiling in Crowd	Jun 1, 2014	Scene Understanding	—Unverified
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?	Oct 1, 2017	16kCamera Pose Estimation	—Unverified
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations	Jun 21, 2025	Question AnsweringScene Understanding	—Unverified
Scene recognition based on DNN and game theory with its applications in human-robot interaction	Dec 3, 2019	Image RegistrationScene Recognition	—Unverified
SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting	Jun 10, 2025	3DGSScene Understanding	—Unverified
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames	Nov 28, 2023	ClusteringDiversity	—Unverified
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments	Nov 28, 2024	Adversarial TextScene Understanding	—Unverified
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate	Dec 26, 2020	Scene Text DetectionScene Understanding	—Unverified
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified
Scene Understanding Enabled Semantic Communication with Open Channel Coding	Jan 24, 2025	Question AnsweringScene Understanding	—Unverified
Scene Understanding for Autonomous Manipulation with Deep Learning	Mar 23, 2019	Action UnderstandingAffordance Detection	—Unverified

Show:10 25 50

← PrevPage 23 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified