Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 1723 papers

Title	Date	Tasks	Status	Hype
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer	Apr 3, 2019	Deep Reinforcement LearningReinforcement Learning	CodeCode Available	0
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences	Apr 2, 2019	3D Semantic SegmentationScene Understanding	CodeCode Available	0
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields	Apr 1, 2019	3D Instance Segmentation3D Semantic Instance Segmentation	CodeCode Available	0
ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data	Apr 1, 2019	Scene ParsingScene Understanding	CodeCode Available	0
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?	Mar 31, 2019	Human-Object Interaction DetectionObject	—Unverified	0
Road Scene Understanding by Occupancy Grid Learning from Sparse Radar Clusters using Semantic Segmentation	Mar 31, 2019	Autonomous Drivingroad scene understanding	CodeCode Available	0
Auto-Embedding Generative Adversarial Networks for High Resolution Image Synthesis	Mar 27, 2019	Generative Adversarial NetworkImage Generation	CodeCode Available	0
Veritatem Dies Aperit- Temporally Consistent Depth Prediction Enabled by a Multi-Task Geometric and Semantic Scene Understanding Approach	Mar 26, 2019	Autonomous DrivingDepth Completion	CodeCode Available	0
Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud	Mar 23, 2019	3D Object DetectionDepth Estimation	CodeCode Available	0
Scene Understanding for Autonomous Manipulation with Deep Learning	Mar 23, 2019	Action UnderstandingAffordance Detection	—Unverified	0
Quantitative Depth Quality Assessment of RGBD Cameras At Close Range Using 3D Printed Fixtures	Mar 21, 2019	Scene Understanding	CodeCode Available	0
Affordance Learning In Direct Perception for Autonomous Driving	Mar 20, 2019	AttributeAutonomous Driving	—Unverified	0
Real time backbone for semantic segmentation	Mar 16, 2019	Autonomous DrivingModel Compression	—Unverified	0
Instance- and Category-level 6D Object Pose Estimation	Mar 11, 2019	6D Pose Estimation using RGBObject	—Unverified	0
Building an Affordances Map with Interactive Perception	Mar 11, 2019	General ClassificationScene Understanding	—Unverified	0
Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction	Mar 9, 2019	DenoisingObject	—Unverified	0
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes	Mar 4, 2019	3D Object DetectionObject	—Unverified	0
An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutions	Feb 20, 2019	Autonomous DrivingScene Understanding	CodeCode Available	0
Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship Detection	Feb 15, 2019	Relationship DetectionScene Understanding	CodeCode Available	0
Gated2Depth: Real-time Dense Lidar from Gated Images	Feb 13, 2019	Scene Understanding	CodeCode Available	0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications	Feb 8, 2019	Deep LearningHigh-Level Synthesis	—Unverified	0
Single Network Panoptic Segmentation for Street Scene Understanding	Feb 7, 2019	Instance SegmentationPanoptic Segmentation	CodeCode Available	0
Real-time 3D Traffic Cone Detection for Autonomous Driving	Feb 6, 2019	3D Object DetectionAutonomous Driving	CodeCode Available	0
VrR-VG: Refocusing Visually-Relevant Relationships	Feb 1, 2019	Image CaptioningQuestion Answering	—Unverified	0
Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection	Jan 25, 2019	Anomaly DetectionDecoder	CodeCode Available	0
Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera	Jan 9, 2019	3D Reconstruction3D Scene Reconstruction	CodeCode Available	0
Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding	Jan 5, 2019	Domain AdaptationScene Understanding	CodeCode Available	1
Learning Spatial Common Sense with Geometry-Aware Recurrent Networks	Dec 31, 2018	Common Sense ReasoningRepresentation Learning	—Unverified	0
Impact of Ground Truth Annotation Quality on Performance of Semantic Image Segmentation of Traffic Conditions	Dec 30, 2018	Autonomous DrivingImage Segmentation	CodeCode Available	0
Reasoning About Physical Interactions with Object-Oriented Prediction and Planning	Dec 28, 2018	ObjectScene Understanding	—Unverified	0
Learning Direct Optimization for Scene Understanding	Dec 18, 2018	Scene Understanding	—Unverified	0
Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation	Dec 17, 2018	ClassificationData Augmentation	—Unverified	0
An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles	Dec 10, 2018	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Counterfactual Critic Multi-Agent Training for Scene Graph Generation	Dec 6, 2018	counterfactualGraph Generation	—Unverified	0
The Right (Angled) Perspective: Improving the Understanding of Road Scenes Using Boosted Inverse Perspective Mapping	Dec 3, 2018	Autonomous VehiclesObject Tracking	—Unverified	0
Submodular Field Grammars: Representation, Inference, and Application to Image Parsing	Dec 1, 2018	Scene Understanding	—Unverified	0
Learning to Exploit Stability for 3D Scene Parsing	Dec 1, 2018	Scene ParsingScene Understanding	—Unverified	0
Multiview Based 3D Scene Understanding On Partial Point Sets	Nov 30, 2018	3D Part Segmentation3D Shape Recognition	—Unverified	0
ShelfNet for Fast Semantic Segmentation	Nov 27, 2018	Autonomous DrivingDecoder	CodeCode Available	0
MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization	Nov 26, 2018	2D Object Detection3D Object Detection	CodeCode Available	0
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments	Nov 26, 2018	Autonomous NavigationDomain Adaptation	CodeCode Available	0
A pooling based scene text proposal technique for scene text reading in the wild	Nov 25, 2018	Scene UnderstandingText Spotting	—Unverified	0
Artificial Color Constancy via GoogLeNet with Angular Loss Function	Nov 20, 2018	Color ConstancyObject Recognition	CodeCode Available	0
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery	Nov 20, 2018	Scene UnderstandingSegmentation	—Unverified	0
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning	Nov 6, 2018	Scene Understanding	—Unverified	0
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation	Oct 31, 2018	3D Object DetectionCamera Pose Estimation	CodeCode Available	0
UAVid: A Semantic Segmentation Dataset for UAV Imagery	Oct 24, 2018	4kAutonomous Driving	CodeCode Available	0
Multi-Task Learning as Multi-Objective Optimization	Oct 10, 2018	Depth EstimationGeneral Classification	CodeCode Available	2
Diagnostics in Semantic Segmentation	Sep 27, 2018	Image SegmentationScene Understanding	—Unverified	0
Semantic and structural image segmentation for prosthetic vision	Sep 25, 2018	Image SegmentationObject	—Unverified	0

Show:10 25 50

← PrevPage 30 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified