Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–875 of 1723 papers

Title	Date	Tasks	Status	Hype
Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics	Mar 8, 2023	Autonomous VehiclesScene Understanding	—Unverified	0
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP	Mar 8, 2023	Scene UnderstandingSemantic Segmentation	—Unverified	0
Traffic Scene Parsing through the TSP6K Dataset	Mar 6, 2023	Autonomous DrivingDecoder	CodeCode Available	1
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning	Mar 5, 2023	Answer GenerationEntity Alignment	CodeCode Available	0
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs	Mar 3, 2023	Depth-aware Video Panoptic SegmentationPanoptic Segmentation	—Unverified	0
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified	0
APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation	Mar 2, 2023	Autonomous DrivingAutonomous Navigation	—Unverified	0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors	Feb 28, 2023	Contrastive LearningInstance Segmentation	—Unverified	0
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information	Feb 25, 2023	DecoderImage Segmentation	—Unverified	0
Open Challenges for Monocular Single-shot 6D Object Pose Estimation	Feb 23, 2023	6D Pose Estimation using RGBObject	—Unverified	0
CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images	Feb 22, 2023	Knowledge DistillationScene Understanding	CodeCode Available	1
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection	Feb 13, 2023	3D Object DetectionGraph Generation	—Unverified	0
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation	Feb 7, 2023	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
Object-Centric Scene Representations using Active Inference	Feb 7, 2023	ObjectScene Understanding	—Unverified	0
Structured Generative Models for Scene Understanding	Feb 7, 2023	Scene Understanding	—Unverified	0
A Flexible Framework for Virtual Omnidirectional Vision to Improve Operator Situation Awareness	Feb 1, 2023	Scene Understanding	—Unverified	0
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis	Jan 30, 2023	Image GenerationScene Understanding	CodeCode Available	2
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation	Jan 26, 2023	FairnessLIDAR Semantic Segmentation	—Unverified	0
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection	Jan 22, 2023	3D Object DetectionAutonomous Vehicles	CodeCode Available	1
Model-based inexact graph matching on top of CNNs for semantic scene understanding	Jan 18, 2023	Brain SegmentationDeep Learning	CodeCode Available	0
Long Range Pooling for 3D Large-Scale Scene Understanding	Jan 17, 2023	Scene Understanding	—Unverified	0
Diffusion-based Generation, Optimization, and Planning in 3D Scenes	Jan 15, 2023	DenoisingGrasp Generation	CodeCode Available	2
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified	0

Show:10 25 50

← PrevPage 35 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified