Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 876–900 of 1723 papers

Title	Date	Tasks	Status	Hype
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1
Neural Radiance Field Codebooks	Jan 10, 2023	ObjectRepresentation Learning	CodeCode Available	0
Plausible Uncertainties for Human Pose Regression	Jan 1, 2023	Autonomous DrivingPose Estimation	—Unverified	0
Visual Traffic Knowledge Graph Generation from Scene Images	Jan 1, 2023	Graph AttentionGraph Generation	—Unverified	0
RealGraph: A Multiview Dataset for 4D Real-world Context Graph Generation	Jan 1, 2023	Graph GenerationScene Understanding	—Unverified	0
Self-Supervised Object Detection from Egocentric Videos	Jan 1, 2023	Class-agnostic Object DetectionObject	—Unverified	0
Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction	Jan 1, 2023	3D Scene ReconstructionImage Segmentation	CodeCode Available	1
Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding	Jan 1, 2023	Autonomous Vehiclesobject-detection	—Unverified	0
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs	Jan 1, 2023	Scene Understanding	—Unverified	0
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation	Jan 1, 2023	Scene UnderstandingSegmentation	—Unverified	0
PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation	Jan 1, 2023	ObjectScene Understanding	CodeCode Available	1
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification	Dec 31, 2022	Scene ClassificationScene Recognition	—Unverified	0
PointVST: Self-Supervised Pre-training for 3D Point Clouds via View-Specific Point-to-Image Translation	Dec 29, 2022	Contrastive LearningImage Generation	CodeCode Available	1
Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding	Dec 22, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	CodeCode Available	0
METEOR Guided Divergence for Video Captioning	Dec 20, 2022	Hierarchical Reinforcement LearningScene Understanding	CodeCode Available	0
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency	Dec 20, 2022	object-detectionObject Detection	—Unverified	0
Panoptic Lifting for 3D Scene Understanding with Neural Fields	Dec 19, 2022	2D Panoptic SegmentationPanoptic Segmentation	CodeCode Available	2
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection	Dec 19, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Lightweight integration of 3D features to improve 2D image segmentation	Dec 16, 2022	Image SegmentationScene Understanding	CodeCode Available	0
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation	Dec 13, 2022	3D Semantic SegmentationScene Understanding	—Unverified	0
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding	Dec 9, 2022	Autonomous DrivingDepth Estimation	—Unverified	0
Towards Holistic Surgical Scene Understanding	Dec 8, 2022	Action RecognitionAtomic action recognition	CodeCode Available	1
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving	Dec 7, 2022	Autonomous DrivingInstance Segmentation	CodeCode Available	1
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data	Dec 7, 2022	Scene UnderstandingSegmentation	—Unverified	0
Framework for 2D Ad placements in LinearTV	Dec 5, 2022	Occlusion HandlingScene Understanding	—Unverified	0

Show:10 25 50

← PrevPage 36 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified