Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 1723 papers

Title	Date	Tasks	Status
Neural Groundplans: Persistent Neural Scene Representations from a Single Image	Jul 22, 2022	DisentanglementInstance Segmentation	—Unverified
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Jul 19, 2022	Image RetrievalRetrieval	—Unverified
Adversarial Attacks on Monocular Pose Estimation	Jul 14, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available
BlindSpotNet: Seeing Where We Cannot See	Jul 8, 2022	Depth EstimationMonocular Depth Estimation	—Unverified
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases	Jul 5, 2022	ObjectRepresentation Learning	—Unverified
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation	Jul 5, 2022	Dialogue GenerationDialogue Understanding	—Unverified
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings	Jun 24, 2022	Scene UnderstandingSemantic Segmentation	CodeCode Available
SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding	Jun 21, 2022	ClusteringObject Discovery	CodeCode Available
A Dynamic Data Driven Approach for Explainable Scene Understanding	Jun 18, 2022	Autonomous DrivingScene Understanding	—Unverified
On Efficient Real-Time Semantic Segmentation: A Survey	Jun 17, 2022	GPUobject-detection	—Unverified
Waymo Open Dataset: Panoramic Video Panoptic Segmentation	Jun 15, 2022	3D Multi-Object TrackingAutonomous Driving	—Unverified
A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth	Jun 13, 2022	Objectobject-detection	—Unverified
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	Jun 10, 2022	Autonomous DrivingDomain Adaptation	CodeCode Available
Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding	Jun 9, 2022	Common Sense ReasoningScene Understanding	—Unverified
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Jun 9, 2022	Data AugmentationEdge Detection	—Unverified
Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans	Jun 6, 2022	Scene Understanding	—Unverified
A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX	Jun 5, 2022	Scene Understanding	—Unverified
Towards Improving the Generation Quality of Autoregressive Slot VAEs	Jun 3, 2022	Image GenerationObject	CodeCode Available
SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment	Jun 1, 2022	Motion PlanningQuestion Answering	—Unverified
Facing the Void: Overcoming Missing Data in Multi-View Imagery	May 21, 2022	Classificationimage-classification	CodeCode Available
Review on Panoramic Imaging and Its Applications in Scene Understanding	May 11, 2022	Autonomous DrivingDepth Estimation	—Unverified
Unsupervised Discovery and Composition of Object Light Fields	May 8, 2022	Novel View SynthesisObject	—Unverified
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects	May 5, 2022	Data AugmentationNeural Rendering	—Unverified
RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds	May 2, 2022	Autonomous DrivingDecoder	—Unverified
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery	Apr 27, 2022	object-detectionObject Detection	—Unverified
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?	Apr 23, 2022	Robot ManipulationScene Understanding	—Unverified
SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints	Apr 20, 2022	Autonomous DrivingScene Understanding	—Unverified
Attention Mechanism based Cognition-level Scene Understanding	Apr 17, 2022	Question AnsweringScene Understanding	—Unverified
MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding	Apr 5, 2022	Autonomous VehiclesScene Understanding	—Unverified
Multi-Task Learning for Visual Scene Understanding	Mar 28, 2022	Multi-Task LearningScene Understanding	—Unverified
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification	Mar 25, 2022	RetrievalScene Understanding	—Unverified
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering	Mar 24, 2022	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified
Self-Supervised Road Layout Parsing with Graph Auto-Encoding	Mar 21, 2022	Image ReconstructionScene Understanding	CodeCode Available
Towards 3D Scene Understanding by Referring Synthetic Models	Mar 20, 2022	Scene UnderstandingTransfer Learning	—Unverified
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows	Mar 20, 2022	Human-Object Interaction DetectionObject	—Unverified
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans	Mar 17, 2022	3D Object Recognitionglobal-optimization	—Unverified
Deep Point Cloud Simplification for High-quality Surface Reconstruction	Mar 17, 2022	Scene UnderstandingSurface Reconstruction	—Unverified
RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry	Mar 14, 2022	Monocular Visual OdometryMotion Estimation	—Unverified
On Steering Multi-Annotations per Sample for Multi-Task Learning	Mar 6, 2022	Instance SegmentationMulti-Task Learning	—Unverified
Fast Neural Architecture Search for Lightweight Dense Prediction Networks	Mar 3, 2022	Depth EstimationImage Super-Resolution	—Unverified
Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection	Mar 2, 2022	Content-Based Image RetrievalDeep Learning	—Unverified
Movies2Scenes: Using Movie Metadata to Learn Scene Representation	Feb 22, 2022	Contrastive LearningScene Understanding	—Unverified
CARL-D: A vision benchmark suite and large scale dataset for vehicle detection and scene segmentation	Feb 17, 2022	2D Object DetectionAutonomous Driving	CodeCode Available
From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection	Feb 15, 2022	Generalized Zero-Shot Object DetectionScene Understanding	CodeCode Available
Catch Me if You Can: A Novel Task for Detection of Covert Geo-Locations (CGL)	Feb 5, 2022	object-detectionObject Detection	—Unverified
StandardSim: A Synthetic Dataset For Retail Environments	Feb 4, 2022	Change DetectionDepth Estimation	—Unverified
Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation	Feb 2, 2022	PointGoal NavigationScene Understanding	CodeCode Available

Show:10 25 50

← PrevPage 24 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified