Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1175 of 1723 papers

Title	Date	Tasks	Status	Hype
A Survey on Deep Learning Technique for Video Segmentation	Jul 2, 2021	Autonomous DrivingDeep Learning	CodeCode Available	1
An Analysis of State-of-the-Art Models for Situated Interactive MultiModal Conversations (SIMMC)	Jul 1, 2021	Scene Understanding	—Unverified	0
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring	Jul 1, 2021	Food RecognitionImage Captioning	—Unverified	0
Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization	Jul 1, 2021	Image SegmentationScene Understanding	—Unverified	0
IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement	Jun 29, 2021	2D Semantic Segmentation3D Semantic Scene Completion	—Unverified	0
False Negative Reduction in Video Instance Segmentation using Uncertainty Estimates	Jun 28, 2021	Depth EstimationInstance Segmentation	CodeCode Available	0
SDOF-Tracker: Fast and Accurate Multiple Human Tracking by Skipped-Detection and Optical-Flow	Jun 27, 2021	Human DetectionOptical Flow Estimation	CodeCode Available	0
OffRoadTranSeg: Semi-Supervised Segmentation using Transformers on OffRoad environments	Jun 26, 2021	Autonomous DrivingDepth Estimation	—Unverified	0
iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability	Jun 25, 2021	Bias DetectionQuestion Answering	—Unverified	0
P2T: Pyramid Pooling Transformer for Scene Understanding	Jun 22, 2021	image-classificationImage Classification	CodeCode Available	1
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation	Jun 21, 2021	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation	Jun 19, 2021	Autonomous DrivingGPU	CodeCode Available	1
OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets	Jun 19, 2021	FrictionInverse Rendering	—Unverified	0
Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion	Jun 19, 2021	Camera Pose EstimationDecoder	—Unverified	0
Part-aware Panoptic Segmentation	Jun 11, 2021	Image SegmentationPanoptic Segmentation	CodeCode Available	1
Vision Transformers with Hierarchical Attention	Jun 6, 2021	image-classificationImage Classification	CodeCode Available	1
Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering	Jun 4, 2021	Meta-LearningScene Understanding	CodeCode Available	1
Towards urban scenes understanding through polarization cues	Jun 3, 2021	Depth EstimationScene Understanding	—Unverified	0
Polarimetric Spatio-Temporal Light Transport Probing	May 25, 2021	MetamerismScene Understanding	—Unverified	0
Egocentric Activity Recognition and Localization on a 3D Map	May 20, 2021	Action LocalizationAction Recognition	—Unverified	0
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data	May 18, 2021	object-detectionObject Detection	—Unverified	0
Image interpretation by iterative bottom-up top-down processing	May 12, 2021	Scene Understanding	CodeCode Available	0
Scene Understanding for Autonomous Driving	May 11, 2021	Autonomous DrivingScene Understanding	—Unverified	0
Lane Graph Estimation for Scene Understanding in Urban Driving	May 1, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene Perception	Apr 27, 2021	Instance Segmentationobject-detection	—Unverified	0

Show:10 25 50

← PrevPage 47 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified