Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1723 papers

Title	Date	Tasks	Status
Robust Category-Level 3D Pose Estimation from Synthetic Data	May 25, 2023	3D Pose Estimation3D Reconstruction	—Unverified
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments	May 25, 2023	Continual LearningContinual Semantic Segmentation	—Unverified
PanoContext-Former: Panoramic Total Scene Understanding with a Transformer	May 21, 2023	3D Object Detectionobject-detection	—Unverified
Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamics Audio-Visual Scenarios	May 21, 2023	Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA)	CodeCode Available
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding	May 18, 2023	Contrastive LearningObject	—Unverified
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs	May 15, 2023	RelationScene Graph Generation	CodeCode Available
MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning	May 13, 2023	Deep LearningDepth Estimation	—Unverified
Transavs: End-To-End Audio-Visual Segmentation With Transformer	May 12, 2023	Scene UnderstandingSegmentation	—Unverified
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs	May 10, 2023	Scene UnderstandingVisual Reasoning	—Unverified
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding	May 8, 2023	PredictionScene Understanding	—Unverified
Living in a Material World: Learning Material Properties from Full-Waveform Flash Lidar Data for Semantic Segmentation	May 7, 2023	Scene UnderstandingSemantic Segmentation	—Unverified
Learning-based Relational Object Matching Across Views	May 3, 2023	Graph Neural NetworkImage Retrieval	—Unverified
ArK: Augmented Reality with Knowledge Interactive Emergent Ability	May 1, 2023	AI AgentMixed Reality	—Unverified
Neural Implicit Dense Semantic SLAM	Apr 27, 2023	3D geometryScene Understanding	—Unverified
Compositional 3D Human-Object Neural Animation	Apr 27, 2023	Human-Object Interaction DetectionNeRF	—Unverified
ZRG: A Dataset for Multimodal 3D Residential Rooftop Understanding	Apr 26, 2023	Scene Understanding	—Unverified
Factored Neural Representation for Scene Understanding	Apr 21, 2023	Novel View SynthesisObject	—Unverified
360^ High-Resolution Depth Estimation via Uncertainty-aware Structural Knowledge Transfer	Apr 17, 2023	Depth EstimationMonocular Depth Estimation	—Unverified
Semantic Segmentation with High Inference Speed in Off-Road Environments	Apr 10, 2023	2D Semantic SegmentationAutonomous Vehicles	CodeCode Available
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation	Apr 10, 2023	Panoptic SegmentationScene Understanding	—Unverified
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding	Apr 4, 2023	Autonomous DrivingDomain Adaptation	CodeCode Available
Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings	Mar 30, 2023	ObjectScene Understanding	—Unverified
OVeNet: Offset Vector Network for Semantic Segmentation	Mar 25, 2023	Optical Character Recognition (OCR)Scene Understanding	CodeCode Available
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation	Mar 25, 2023	Domain AdaptationERP	—Unverified
Uni-Fusion: Universal Continuous Mapping	Mar 22, 2023	Scene Understanding	—Unverified
Semantic segmentation of surgical hyperspectral images under geometric domain shifts	Mar 20, 2023	Organ SegmentationScene Segmentation	—Unverified
Content Adaptive Front End For Audio Classification	Mar 18, 2023	Audio ClassificationAudio Signal Processing	—Unverified
Efficient Computation Sharing for Multi-Task Visual Scene Understanding	Mar 16, 2023	Multi-Task LearningScene Understanding	CodeCode Available
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery	Mar 16, 2023	Scene Understanding	—Unverified
PENet: A Joint Panoptic Edge Detection Network	Mar 15, 2023	Edge DetectionMulti-Task Learning	CodeCode Available
Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast	Mar 11, 2023	3D Semantic SegmentationContrastive Learning	—Unverified
Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics	Mar 8, 2023	Autonomous VehiclesScene Understanding	—Unverified
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP	Mar 8, 2023	Scene UnderstandingSemantic Segmentation	—Unverified
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning	Mar 5, 2023	Answer GenerationEntity Alignment	CodeCode Available
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs	Mar 3, 2023	Depth-aware Video Panoptic SegmentationPanoptic Segmentation	—Unverified
APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation	Mar 2, 2023	Autonomous DrivingAutonomous Navigation	—Unverified
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors	Feb 28, 2023	Contrastive LearningInstance Segmentation	—Unverified
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information	Feb 25, 2023	DecoderImage Segmentation	—Unverified
Open Challenges for Monocular Single-shot 6D Object Pose Estimation	Feb 23, 2023	6D Pose Estimation using RGBObject	—Unverified
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection	Feb 13, 2023	3D Object DetectionGraph Generation	—Unverified
Structured Generative Models for Scene Understanding	Feb 7, 2023	Scene Understanding	—Unverified
Object-Centric Scene Representations using Active Inference	Feb 7, 2023	ObjectScene Understanding	—Unverified
A Flexible Framework for Virtual Omnidirectional Vision to Improve Operator Situation Awareness	Feb 1, 2023	Scene Understanding	—Unverified
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation	Jan 26, 2023	FairnessLIDAR Semantic Segmentation	—Unverified
Model-based inexact graph matching on top of CNNs for semantic scene understanding	Jan 18, 2023	Brain SegmentationDeep Learning	CodeCode Available
Long Range Pooling for 3D Large-Scale Scene Understanding	Jan 17, 2023	Scene Understanding	—Unverified
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified
Neural Radiance Field Codebooks	Jan 10, 2023	ObjectRepresentation Learning	CodeCode Available
Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding	Jan 1, 2023	Autonomous Vehiclesobject-detection	—Unverified

Show:10 25 50

← PrevPage 22 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified