Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1723 papers

Title	Date	Tasks	Status
Visual Traffic Knowledge Graph Generation from Scene Images	Jan 1, 2023	Graph AttentionGraph Generation	—Unverified
Plausible Uncertainties for Human Pose Regression	Jan 1, 2023	Autonomous DrivingPose Estimation	—Unverified
Self-Supervised Object Detection from Egocentric Videos	Jan 1, 2023	Class-agnostic Object DetectionObject	—Unverified
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs	Jan 1, 2023	Scene Understanding	—Unverified
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation	Jan 1, 2023	Scene UnderstandingSegmentation	—Unverified
RealGraph: A Multiview Dataset for 4D Real-world Context Graph Generation	Jan 1, 2023	Graph GenerationScene Understanding	—Unverified
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification	Dec 31, 2022	Scene ClassificationScene Recognition	—Unverified
Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding	Dec 22, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	CodeCode Available
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency	Dec 20, 2022	object-detectionObject Detection	—Unverified
METEOR Guided Divergence for Video Captioning	Dec 20, 2022	Hierarchical Reinforcement LearningScene Understanding	CodeCode Available
Lightweight integration of 3D features to improve 2D image segmentation	Dec 16, 2022	Image SegmentationScene Understanding	CodeCode Available
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation	Dec 13, 2022	3D Semantic SegmentationScene Understanding	—Unverified
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding	Dec 9, 2022	Autonomous DrivingDepth Estimation	—Unverified
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data	Dec 7, 2022	Scene UnderstandingSegmentation	—Unverified
Framework for 2D Ad placements in LinearTV	Dec 5, 2022	Occlusion HandlingScene Understanding	—Unverified
3D Object Aided Self-Supervised Monocular Depth Estimation	Dec 4, 2022	3D Object DetectionAutonomous Driving	—Unverified
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding	Dec 4, 2022	6D Pose Estimation using RGBObject	—Unverified
Prediction of Scene Plausibility	Dec 2, 2022	PredictionScene Understanding	—Unverified
SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Nov 30, 2022	Graph GenerationImage Generation	CodeCode Available
Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene Understanding	Nov 28, 2022	Contrastive LearningDecision Making	CodeCode Available
Learning 3D Scene Priors with 2D Supervision	Nov 25, 2022	DecoderScene Understanding	—Unverified
PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples	Nov 22, 2022	Adversarial AttackPoint Cloud Classification	—Unverified
Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse Weather	Nov 21, 2022	Autonomous DrivingGPU	CodeCode Available
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors	Nov 21, 2022	ObjectPose Estimation	—Unverified
Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond Aberrations	Nov 21, 2022	Domain AdaptationScene Understanding	CodeCode Available
An Enhanced Object Detection Model for Scene Graph Generation	Nov 18, 2022	Graph GenerationImage Captioning	—Unverified
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection	Nov 15, 2022	Autonomous DrivingGPU	—Unverified
FlowGrad: Using Motion for Visual Sound Source Localization	Nov 15, 2022	Optical Flow EstimationScene Understanding	CodeCode Available
Visually Grounded VQA by Lattice-based Retrieval	Nov 15, 2022	Information RetrievalQuestion Answering	CodeCode Available
User Identification: A Key Enabler for Multi-User Vision-Aided Communications	Oct 27, 2022	Scene UnderstandingUser Identification	—Unverified
Visual Semantic Parsing: From Images to Abstract Meaning Representation	Oct 26, 2022	Abstract Meaning RepresentationScene Understanding	—Unverified
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation	Oct 18, 2022	3D Semantic SegmentationScene Understanding	—Unverified
Novel 3D Scene Understanding Applications From Recurrence in a Single Image	Oct 14, 2022	Scene UnderstandingTranslation	—Unverified
Segmentation-guided Domain Adaptation for Efficient Depth Completion	Oct 14, 2022	Depth CompletionDomain Adaptation	—Unverified
EarthNets: Empowering AI in Earth Observation	Oct 10, 2022	Deep LearningEarth Observation	—Unverified
Uncertainty-aware LiDAR Panoptic Segmentation	Oct 10, 2022	Autonomous DrivingPanoptic Segmentation	CodeCode Available
Flow-based GAN for 3D Point Cloud Generation from a Single Image	Oct 8, 2022	Point Cloud GenerationScene Understanding	CodeCode Available
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Oct 6, 2022	Scene Understanding	—Unverified
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation	Oct 2, 2022	Scene UnderstandingSegmentation	CodeCode Available
A Survey on Knowledge Graph-based Methods for Automated Driving	Sep 30, 2022	Knowledge Graph EmbeddingsKnowledge Graphs	—Unverified
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents	Sep 27, 2022	3D Object DetectionAutonomous Driving	—Unverified
Stochastic Future Prediction in Real World Driving Scenarios	Sep 21, 2022	Autonomous DrivingFuture prediction	—Unverified
A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Sep 12, 2022	Scene Understanding	—Unverified
Sequential Cross Attention Based Multi-task Learning	Sep 6, 2022	Multi-Task LearningScene Understanding	CodeCode Available
Neuromorphic Visual Scene Understanding with Resonator Networks	Aug 26, 2022	Scene UnderstandingTranslation	—Unverified
Learning in Audio-visual Context: A Review, Analysis, and New Perspective	Aug 20, 2022	audio-visual learningScene Understanding	—Unverified
Safety Assessment for Autonomous Systems' Perception Capabilities	Aug 17, 2022	Decision MakingScene Understanding	—Unverified
Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point Supervision	Aug 10, 2022	3D Instance SegmentationInstance Segmentation	CodeCode Available
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy	Aug 3, 2022	Anatomymotion prediction	—Unverified
CompNVS: Novel View Synthesis with Scene Completion	Jul 23, 2022	Novel View SynthesisScene Understanding	—Unverified

Show:10 25 50

← PrevPage 23 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified