Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1176–1200 of 1723 papers

Title	Date	Tasks	Status
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery	Apr 27, 2022	object-detectionObject Detection	—Unverified
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?	Apr 23, 2022	Robot ManipulationScene Understanding	—Unverified
SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints	Apr 20, 2022	Autonomous DrivingScene Understanding	—Unverified
Attention Mechanism based Cognition-level Scene Understanding	Apr 17, 2022	Question AnsweringScene Understanding	—Unverified
MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding	Apr 5, 2022	Autonomous VehiclesScene Understanding	—Unverified
Multi-Task Learning for Visual Scene Understanding	Mar 28, 2022	Multi-Task LearningScene Understanding	—Unverified
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification	Mar 25, 2022	RetrievalScene Understanding	—Unverified
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering	Mar 24, 2022	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified
Self-Supervised Road Layout Parsing with Graph Auto-Encoding	Mar 21, 2022	Image ReconstructionScene Understanding	CodeCode Available
Towards 3D Scene Understanding by Referring Synthetic Models	Mar 20, 2022	Scene UnderstandingTransfer Learning	—Unverified
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows	Mar 20, 2022	Human-Object Interaction DetectionObject	—Unverified
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans	Mar 17, 2022	3D Object Recognitionglobal-optimization	—Unverified
Deep Point Cloud Simplification for High-quality Surface Reconstruction	Mar 17, 2022	Scene UnderstandingSurface Reconstruction	—Unverified
RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry	Mar 14, 2022	Monocular Visual OdometryMotion Estimation	—Unverified
On Steering Multi-Annotations per Sample for Multi-Task Learning	Mar 6, 2022	Instance SegmentationMulti-Task Learning	—Unverified
Fast Neural Architecture Search for Lightweight Dense Prediction Networks	Mar 3, 2022	Depth EstimationImage Super-Resolution	—Unverified
Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection	Mar 2, 2022	Content-Based Image RetrievalDeep Learning	—Unverified
Movies2Scenes: Using Movie Metadata to Learn Scene Representation	Feb 22, 2022	Contrastive LearningScene Understanding	—Unverified
CARL-D: A vision benchmark suite and large scale dataset for vehicle detection and scene segmentation	Feb 17, 2022	2D Object DetectionAutonomous Driving	CodeCode Available
From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection	Feb 15, 2022	Generalized Zero-Shot Object DetectionScene Understanding	CodeCode Available
Catch Me if You Can: A Novel Task for Detection of Covert Geo-Locations (CGL)	Feb 5, 2022	object-detectionObject Detection	—Unverified
StandardSim: A Synthetic Dataset For Retail Environments	Feb 4, 2022	Change DetectionDepth Estimation	—Unverified
Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation	Feb 2, 2022	PointGoal NavigationScene Understanding	CodeCode Available

Show:10 25 50

← PrevPage 48 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified