Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1723 papers

Title	Date	Tasks	Status	Hype
Towards Scene Understanding for Autonomous Operations on Airport Aprons	Dec 4, 2022	Autonomous DrivingBenchmarking	CodeCode Available	1
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding	Dec 4, 2022	6D Pose Estimation using RGBObject	—Unverified	0
3D Object Aided Self-Supervised Monocular Depth Estimation	Dec 4, 2022	3D Object DetectionAutonomous Driving	—Unverified	0
Prediction of Scene Plausibility	Dec 2, 2022	PredictionScene Understanding	—Unverified	0
SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Nov 30, 2022	Graph GenerationImage Generation	CodeCode Available	0
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding	Nov 29, 2022	3D Open-Vocabulary Instance SegmentationContrastive Learning	CodeCode Available	2
Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene Understanding	Nov 28, 2022	Contrastive LearningDecision Making	CodeCode Available	0
OpenScene: 3D Scene Understanding with Open Vocabularies	Nov 28, 2022	3D Open-Vocabulary Instance Segmentation3D Semantic Segmentation	CodeCode Available	2
Learning 3D Scene Priors with 2D Supervision	Nov 25, 2022	DecoderScene Understanding	—Unverified	0
Language-Assisted 3D Feature Learning for Semantic Scene Understanding	Nov 25, 2022	DescriptiveInstance Segmentation	CodeCode Available	1
PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples	Nov 22, 2022	Adversarial AttackPoint Cloud Classification	—Unverified	0
Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse Weather	Nov 21, 2022	Autonomous DrivingGPU	CodeCode Available	0
Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond Aberrations	Nov 21, 2022	Domain AdaptationScene Understanding	CodeCode Available	0
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors	Nov 21, 2022	ObjectPose Estimation	—Unverified	0
An Enhanced Object Detection Model for Scene Graph Generation	Nov 18, 2022	Graph GenerationImage Captioning	—Unverified	0
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection	Nov 17, 2022	3D Object DetectionDepth Estimation	CodeCode Available	1
FlowGrad: Using Motion for Visual Sound Source Localization	Nov 15, 2022	Optical Flow EstimationScene Understanding	CodeCode Available	0
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection	Nov 15, 2022	Autonomous DrivingGPU	—Unverified	0
Visually Grounded VQA by Lattice-based Retrieval	Nov 15, 2022	Information RetrievalQuestion Answering	CodeCode Available	0
User Identification: A Key Enabler for Multi-User Vision-Aided Communications	Oct 27, 2022	Scene UnderstandingUser Identification	—Unverified	0
RGB-T Semantic Segmentation with Location, Activation, and Sharpening	Oct 26, 2022	DecoderScene Understanding	CodeCode Available	1
Visual Semantic Parsing: From Images to Abstract Meaning Representation	Oct 26, 2022	Abstract Meaning RepresentationScene Understanding	—Unverified	0
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data	Oct 25, 2022	Autonomous DrivingGPU	CodeCode Available	1
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models	Oct 18, 2022	image-classificationImage Classification	CodeCode Available	1
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation	Oct 18, 2022	3D Semantic SegmentationScene Understanding	—Unverified	0
Segmentation-guided Domain Adaptation for Efficient Depth Completion	Oct 14, 2022	Depth CompletionDomain Adaptation	—Unverified	0
Novel 3D Scene Understanding Applications From Recurrence in a Single Image	Oct 14, 2022	Scene UnderstandingTranslation	—Unverified	0
SQA3D: Situated Question Answering in 3D Scenes	Oct 14, 2022	Question AnsweringReferring Expression	CodeCode Available	1
EarthNets: Empowering AI in Earth Observation	Oct 10, 2022	Deep LearningEarth Observation	—Unverified	0
Uncertainty-aware LiDAR Panoptic Segmentation	Oct 10, 2022	Autonomous DrivingPanoptic Segmentation	CodeCode Available	0
Flow-based GAN for 3D Point Cloud Generation from a Single Image	Oct 8, 2022	Point Cloud GenerationScene Understanding	CodeCode Available	0
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Oct 6, 2022	Scene Understanding	—Unverified	0
Image Masking for Robust Self-Supervised Monocular Depth Estimation	Oct 5, 2022	Autonomous DrivingDepth Estimation	CodeCode Available	1
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions	Oct 4, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available	1
Uncertainty-Driven Active Vision for Implicit Scene Reconstruction	Oct 3, 2022	Scene Understanding	CodeCode Available	1
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation	Oct 2, 2022	Scene UnderstandingSegmentation	CodeCode Available	0
A Survey on Knowledge Graph-based Methods for Automated Driving	Sep 30, 2022	Knowledge Graph EmbeddingsKnowledge Graphs	—Unverified	0
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents	Sep 27, 2022	3D Object DetectionAutonomous Driving	—Unverified	0
Stochastic Future Prediction in Real World Driving Scenarios	Sep 21, 2022	Autonomous DrivingFuture prediction	—Unverified	0
Dynamic Graph Message Passing Networks for Visual Recognition	Sep 20, 2022	image-classificationImage Classification	CodeCode Available	1
A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding	Sep 12, 2022	Scene Understanding	—Unverified	0
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge	Sep 12, 2022	Instance SegmentationObject Detection	CodeCode Available	1
Leveraging Large (Visual) Language Models for Robot 3D Scene Understanding	Sep 12, 2022	Common Sense ReasoningScene Classification	CodeCode Available	1
MassMIND: Massachusetts Maritime INfrared Dataset	Sep 9, 2022	Instance SegmentationScene Understanding	CodeCode Available	1
Sequential Cross Attention Based Multi-task Learning	Sep 6, 2022	Multi-Task LearningScene Understanding	CodeCode Available	0
SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion	Sep 1, 2022	Depth CompletionScene Understanding	CodeCode Available	1
Neuromorphic Visual Scene Understanding with Resonator Networks	Aug 26, 2022	Scene UnderstandingTranslation	—Unverified	0
Learning in Audio-visual Context: A Review, Analysis, and New Perspective	Aug 20, 2022	audio-visual learningScene Understanding	—Unverified	0
Safety Assessment for Autonomous Systems' Perception Capabilities	Aug 17, 2022	Decision MakingScene Understanding	—Unverified	0
Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point Supervision	Aug 10, 2022	3D Instance SegmentationInstance Segmentation	CodeCode Available	0

Show:10 25 50

← PrevPage 19 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified