Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1723 papers

Title	Date	Tasks	Status	Hype
Point Cloud Pre-Training With Natural 3D Structures	Jan 1, 2022	3D Object Detectionobject-detection	CodeCode Available	1
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation	Dec 27, 2021	Computational EfficiencyInstance Segmentation	CodeCode Available	1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation	Dec 24, 2021	Depth EstimationDepth Prediction	CodeCode Available	1
HSPACE: Synthetic Parametric Humans Animated in Complex Environments	Dec 23, 2021	3D Human Pose EstimationScene Understanding	—Unverified	0
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation	Dec 22, 2021	Common Sense ReasoningQuestion Answering	CodeCode Available	1
ScanQA: 3D Question Answering for Spatial Scene Understanding	Dec 20, 2021	3D Question Answering (3D-QA)Object	CodeCode Available	1
Distillation of Human-Object Interaction Contexts for Action Recognition	Dec 17, 2021	Action RecognitionGraph Attention	—Unverified	0
Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation	Dec 16, 2021	Feature ImportanceScene Understanding	CodeCode Available	1
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition	Dec 14, 2021	Human-Object Interaction DetectionScene Understanding	—Unverified	0
Image-to-Height Domain Translation for Synthetic Aperture Sonar	Dec 12, 2021	Generative Adversarial NetworkScene Understanding	—Unverified	0
3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map	Dec 10, 2021	Autonomous VehiclesNavigate	—Unverified	0
Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms	Dec 10, 2021	3D ReconstructionAutonomous Navigation	—Unverified	0
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding	Dec 6, 2021	3D Instance Segmentation3D Semantic Segmentation	—Unverified	0
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation	Dec 5, 2021	Depth-aware Video Panoptic SegmentationDepth Estimation	CodeCode Available	1
Behind the Curtain: Learning Occluded Shapes for 3D Object Detection	Dec 4, 2021	3D Object DetectionObject	CodeCode Available	1
Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans	Dec 1, 2021	4D Panoptic SegmentationAutonomous Navigation	CodeCode Available	0
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding	Dec 1, 2021	DisentanglementDomain Adaptation	—Unverified	0
REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision	Dec 1, 2021	3D Human Reconstruction3D Reconstruction	—Unverified	0
Joint Modeling of Visual Objects and Relations for Scene Graph Generation	Dec 1, 2021	Graph EmbeddingGraph Generation	—Unverified	0
AirObject: A Temporally Evolving Graph Embedding for Object Identification	Nov 30, 2021	Graph AttentionGraph Embedding	CodeCode Available	1
Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding	Nov 30, 2021	Domain AdaptationLanguage Modeling	—Unverified	0
DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes	Nov 30, 2021	FrictionObject	—Unverified	0
Instance-wise Occlusion and Depth Orders in Natural Scenes	Nov 29, 2021	Depth EstimationDepth Prediction	CodeCode Available	1
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds	Nov 28, 2021	3D Shape Classificationgraph construction	—Unverified	0
Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation	Nov 26, 2021	AllGraph Generation	—Unverified	0
Joint stereo 3D object detection and implicit surface reconstruction	Nov 25, 2021	3D Object DetectionHallucination	CodeCode Available	0
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing	Nov 24, 2021	AttributeScene Understanding	CodeCode Available	1
Panoptic Segmentation Meets Remote Sensing	Nov 23, 2021	Panoptic SegmentationScene Understanding	—Unverified	0
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot	Nov 22, 2021	Scene Understanding	—Unverified	0
Grounded Situation Recognition with Transformers	Nov 19, 2021	DecoderGrounded Situation Recognition	CodeCode Available	1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data	Nov 17, 2021	3D Object Detectionobject-detection	CodeCode Available	1
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion	Nov 16, 2021	3D Semantic SegmentationAutonomous Driving	—Unverified	0
Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views	Nov 13, 2021	ObjectScene Understanding	CodeCode Available	1
Robust deep learning-based semantic organ segmentation in hyperspectral images	Nov 9, 2021	Deep LearningImage Segmentation	—Unverified	0
DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder	Nov 5, 2021	Autonomous VehiclesImage Segmentation	—Unverified	0
When Neural Networks Using Different Sensors Create Similar Features	Nov 4, 2021	Autonomous DrivingClassification	—Unverified	0
Panoptic 3D Scene Reconstruction From a Single RGB Image	Nov 3, 2021	2D Panoptic Segmentation3D Instance Segmentation	CodeCode Available	1
3DP3: 3D Scene Perception via Probabilistic Programming	Oct 30, 2021	ObjectPose Estimation	CodeCode Available	1
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving	Oct 22, 2021	Autonomous Drivingreinforcement-learning	CodeCode Available	1
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety	Oct 22, 2021	Scene Understanding	—Unverified	0
PlaneRecNet: Multi-Task Learning with Cross-Task Consistency for Piece-Wise Plane Detection and Reconstruction from a Single RGB Image	Oct 21, 2021	DecoderDepth Estimation	CodeCode Available	1
Adversarial Scene Reconstruction and Object Detection System for Assisting Autonomous Vehicle	Oct 13, 2021	Deep Learningobject-detection	—Unverified	0
Monocular Depth Estimation with Sharp Boundary	Oct 12, 2021	DecoderDepth Estimation	—Unverified	0
Structured Bird's-Eye-View Traffic Scene Understanding from Onboard Images	Oct 5, 2021	Autonomous NavigationLane Detection	CodeCode Available	1
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation	Sep 30, 2021	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Semantic Dense Reconstruction with Consistent Scene Segments	Sep 30, 2021	3D Scene ReconstructionScene Understanding	—Unverified	0
D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation	Sep 29, 2021	Depth EstimationMonocular Depth Estimation	CodeCode Available	0
Referring Self-supervised Learning on 3D Point Cloud	Sep 29, 2021	Scene UnderstandingSelf-Supervised Learning	—Unverified	0
Efficient Point Transformer for Large-scale 3D Scene Understanding	Sep 29, 2021	3D Semantic SegmentationQuantization	—Unverified	0
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D	Sep 28, 2021	Multiple Object TrackingNovel View Synthesis	CodeCode Available	1

Show:10 25 50

← PrevPage 22 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified