Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1723 papers

Title	Date	Tasks	Status	Hype
Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics	Mar 8, 2023	Autonomous VehiclesScene Understanding	—Unverified	0
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP	Mar 8, 2023	Scene UnderstandingSemantic Segmentation	—Unverified	0
Traffic Scene Parsing through the TSP6K Dataset	Mar 6, 2023	Autonomous DrivingDecoder	CodeCode Available	1
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning	Mar 5, 2023	Answer GenerationEntity Alignment	CodeCode Available	0
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs	Mar 3, 2023	Depth-aware Video Panoptic SegmentationPanoptic Segmentation	—Unverified	0
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified	0
APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation	Mar 2, 2023	Autonomous DrivingAutonomous Navigation	—Unverified	0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors	Feb 28, 2023	Contrastive LearningInstance Segmentation	—Unverified	0
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information	Feb 25, 2023	DecoderImage Segmentation	—Unverified	0
Open Challenges for Monocular Single-shot 6D Object Pose Estimation	Feb 23, 2023	6D Pose Estimation using RGBObject	—Unverified	0
CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images	Feb 22, 2023	Knowledge DistillationScene Understanding	CodeCode Available	1
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection	Feb 13, 2023	3D Object DetectionGraph Generation	—Unverified	0
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation	Feb 7, 2023	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
Object-Centric Scene Representations using Active Inference	Feb 7, 2023	ObjectScene Understanding	—Unverified	0
Structured Generative Models for Scene Understanding	Feb 7, 2023	Scene Understanding	—Unverified	0
A Flexible Framework for Virtual Omnidirectional Vision to Improve Operator Situation Awareness	Feb 1, 2023	Scene Understanding	—Unverified	0
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis	Jan 30, 2023	Image GenerationScene Understanding	CodeCode Available	2
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation	Jan 26, 2023	FairnessLIDAR Semantic Segmentation	—Unverified	0
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection	Jan 22, 2023	3D Object DetectionAutonomous Vehicles	CodeCode Available	1
Model-based inexact graph matching on top of CNNs for semantic scene understanding	Jan 18, 2023	Brain SegmentationDeep Learning	CodeCode Available	0
Long Range Pooling for 3D Large-Scale Scene Understanding	Jan 17, 2023	Scene Understanding	—Unverified	0
Diffusion-based Generation, Optimization, and Planning in 3D Scenes	Jan 15, 2023	DenoisingGrasp Generation	CodeCode Available	2
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified	0
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1
Neural Radiance Field Codebooks	Jan 10, 2023	ObjectRepresentation Learning	CodeCode Available	0
Plausible Uncertainties for Human Pose Regression	Jan 1, 2023	Autonomous DrivingPose Estimation	—Unverified	0
Visual Traffic Knowledge Graph Generation from Scene Images	Jan 1, 2023	Graph AttentionGraph Generation	—Unverified	0
RealGraph: A Multiview Dataset for 4D Real-world Context Graph Generation	Jan 1, 2023	Graph GenerationScene Understanding	—Unverified	0
Self-Supervised Object Detection from Egocentric Videos	Jan 1, 2023	Class-agnostic Object DetectionObject	—Unverified	0
Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction	Jan 1, 2023	3D Scene ReconstructionImage Segmentation	CodeCode Available	1
Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding	Jan 1, 2023	Autonomous Vehiclesobject-detection	—Unverified	0
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs	Jan 1, 2023	Scene Understanding	—Unverified	0
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation	Jan 1, 2023	Scene UnderstandingSegmentation	—Unverified	0
PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation	Jan 1, 2023	ObjectScene Understanding	CodeCode Available	1
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification	Dec 31, 2022	Scene ClassificationScene Recognition	—Unverified	0
PointVST: Self-Supervised Pre-training for 3D Point Clouds via View-Specific Point-to-Image Translation	Dec 29, 2022	Contrastive LearningImage Generation	CodeCode Available	1
Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding	Dec 22, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	CodeCode Available	0
METEOR Guided Divergence for Video Captioning	Dec 20, 2022	Hierarchical Reinforcement LearningScene Understanding	CodeCode Available	0
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency	Dec 20, 2022	object-detectionObject Detection	—Unverified	0
Panoptic Lifting for 3D Scene Understanding with Neural Fields	Dec 19, 2022	2D Panoptic SegmentationPanoptic Segmentation	CodeCode Available	2
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection	Dec 19, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Lightweight integration of 3D features to improve 2D image segmentation	Dec 16, 2022	Image SegmentationScene Understanding	CodeCode Available	0
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation	Dec 13, 2022	3D Semantic SegmentationScene Understanding	—Unverified	0
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding	Dec 9, 2022	Autonomous DrivingDepth Estimation	—Unverified	0
Towards Holistic Surgical Scene Understanding	Dec 8, 2022	Action RecognitionAtomic action recognition	CodeCode Available	1
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving	Dec 7, 2022	Autonomous DrivingInstance Segmentation	CodeCode Available	1
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data	Dec 7, 2022	Scene UnderstandingSegmentation	—Unverified	0
Framework for 2D Ad placements in LinearTV	Dec 5, 2022	Occlusion HandlingScene Understanding	—Unverified	0

Show:10 25 50

← PrevPage 18 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified