Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 1723 papers

Title	Date	Tasks	Status	Hype
STRAP: Structured Object Affordance Segmentation with Point Supervision	Apr 17, 2023	ObjectScene Understanding	CodeCode Available	1
Learning How To Robustly Estimate Camera Pose in Endoscopic Videos	Apr 17, 2023	3D ReconstructionCamera Pose Estimation	CodeCode Available	1
RS2G: Data-Driven Scene-Graph Extraction and Embedding for Robust Autonomous Perception and Scenario Understanding	Apr 17, 2023	Autonomous VehiclesGraph Learning	CodeCode Available	1
ViPLO: Vision Transformer based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection	Apr 17, 2023	Human-Object Interaction DetectionQuantization	CodeCode Available	1
Complementary Random Masking for RGB-Thermal Semantic Segmentation	Mar 30, 2023	Scene UnderstandingSemantic Segmentation	CodeCode Available	1
DPF: Learning Dense Prediction Fields with Weak Supervision	Mar 29, 2023	Intrinsic Image DecompositionPrediction	CodeCode Available	1
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation	Mar 28, 2023	Panoptic Scene Graph GenerationScene Graph Generation	CodeCode Available	1
Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments	Mar 27, 2023	Autonomous NavigationReal-Time Semantic Segmentation	CodeCode Available	1
You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding	Mar 26, 2023	3D Instance SegmentationInstance Segmentation	CodeCode Available	1
Viewpoint Equivariance for Multi-View 3D Object Detection	Mar 25, 2023	3D Object DetectionObject	CodeCode Available	1
Self-distillation for surgical action recognition	Mar 22, 2023	Action RecognitionMedical Image Analysis	CodeCode Available	1
Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor Localization	Mar 20, 2023	3D Object DetectionIndoor Localization	CodeCode Available	1
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection	Mar 14, 2023	3D Object DetectionDecoder	CodeCode Available	1
Traffic Scene Parsing through the TSP6K Dataset	Mar 6, 2023	Autonomous DrivingDecoder	CodeCode Available	1
CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images	Feb 22, 2023	Knowledge DistillationScene Understanding	CodeCode Available	1
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks	Feb 17, 2023	DeblurringDeep Learning	CodeCode Available	1
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation	Feb 7, 2023	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection	Jan 22, 2023	3D Object DetectionAutonomous Vehicles	CodeCode Available	1
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1
Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction	Jan 1, 2023	3D Scene ReconstructionImage Segmentation	CodeCode Available	1
PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation	Jan 1, 2023	ObjectScene Understanding	CodeCode Available	1
PointVST: Self-Supervised Pre-training for 3D Point Clouds via View-Specific Point-to-Image Translation	Dec 29, 2022	Contrastive LearningImage Generation	CodeCode Available	1
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection	Dec 19, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Towards Holistic Surgical Scene Understanding	Dec 8, 2022	Action RecognitionAtomic action recognition	CodeCode Available	1

Show:10 25 50

← PrevPage 11 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified