Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1550 of 1723 papers

Title	Date	Tasks	Status
Multiview Based 3D Scene Understanding On Partial Point Sets	Nov 30, 2018	3D Part Segmentation3D Shape Recognition	—Unverified
ShelfNet for Fast Semantic Segmentation	Nov 27, 2018	Autonomous DrivingDecoder	CodeCode Available
MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization	Nov 26, 2018	2D Object Detection3D Object Detection	CodeCode Available
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments	Nov 26, 2018	Autonomous NavigationDomain Adaptation	CodeCode Available
A pooling based scene text proposal technique for scene text reading in the wild	Nov 25, 2018	Scene UnderstandingText Spotting	—Unverified
Artificial Color Constancy via GoogLeNet with Angular Loss Function	Nov 20, 2018	Color ConstancyObject Recognition	CodeCode Available
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery	Nov 20, 2018	Scene UnderstandingSegmentation	—Unverified
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning	Nov 6, 2018	Scene Understanding	—Unverified
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation	Oct 31, 2018	3D Object DetectionCamera Pose Estimation	CodeCode Available
UAVid: A Semantic Segmentation Dataset for UAV Imagery	Oct 24, 2018	4kAutonomous Driving	CodeCode Available
Diagnostics in Semantic Segmentation	Sep 27, 2018	Image SegmentationScene Understanding	—Unverified
Semantic and structural image segmentation for prosthetic vision	Sep 25, 2018	Image SegmentationObject	—Unverified
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation	Sep 24, 2018	Autonomous DrivingReal-Time Semantic Segmentation	CodeCode Available
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM	Sep 14, 2018	Bayesian InferenceObject	—Unverified
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions	Sep 11, 2018	Question AnsweringScene Understanding	—Unverified
Context-Dependent Diffusion Network for Visual Relationship Detection	Sep 11, 2018	DiversityObject	—Unverified
On the Importance of Visual Context for Data Augmentation in Scene Understanding	Sep 6, 2018	Data AugmentationInstance Segmentation	—Unverified
Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?	Sep 5, 2018	3D ReconstructionDepth Estimation	CodeCode Available
Modeling human intuitions about liquid flow with particle-based simulation	Sep 5, 2018	Scene Understanding	—Unverified
BOLD5000: A public fMRI dataset of 5000 images	Sep 5, 2018	DiversityScene Understanding	CodeCode Available
Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images	Sep 4, 2018	AttributeDynamic Time Warping	CodeCode Available
Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular video	Sep 1, 2018	Scene UnderstandingSmall Data Image Classification	—Unverified
Localization Guided Learning for Pedestrian Attribute Recognition	Aug 28, 2018	AttributePedestrian Attribute Recognition	—Unverified
COFGA: Classification Of Fine-Grained Features In Aerial Images	Aug 27, 2018	ClassificationGeneral Classification	—Unverified
Single Shot Scene Text Retrieval	Aug 27, 2018	Image RetrievalRetrieval	CodeCode Available
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset	Aug 25, 2018	Deep Reinforcement Learningreinforcement-learning	—Unverified
Second-order Democratic Aggregation	Aug 22, 2018	General ClassificationMaterial Classification	—Unverified
Deep Learned Full-3D Object Completion from Single View	Aug 21, 2018	3D geometry3D Reconstruction	—Unverified
Learning Monocular Depth by Distilling Cross-domain Stereo Networks	Aug 20, 2018	Autonomous DrivingDepth Estimation	CodeCode Available
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image	Aug 7, 2018	3D Object DetectionMonocular 3D Object Detection	CodeCode Available
Parsing Geometry Using Structure-Aware Shape Templates	Aug 3, 2018	ObjectObject Recognition	CodeCode Available
Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding	Aug 3, 2018	Scene UnderstandingSemantic Segmentation	—Unverified
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators	Aug 1, 2018	AttributeNatural Questions	—Unverified
A Reinforcement Learning Approach to Target Tracking in a Camera Network	Jul 26, 2018	Q-Learningreinforcement-learning	—Unverified
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization	Jul 19, 2018	object-detectionObject Detection	—Unverified
Three for one and one for three: Flow, Segmentation, and Surface Normals	Jul 19, 2018	Optical Flow EstimationScene Understanding	CodeCode Available
Visual Affordance and Function Understanding: A Survey	Jul 18, 2018	Affordance DetectionScene Understanding	—Unverified
A Reflectance Based Method For Shadow Detection and Removal	Jul 11, 2018	Detecting ShadowsScene Understanding	—Unverified
End-to-End Race Driving with Deep Reinforcement Learning	Jul 6, 2018	Deep Reinforcement LearningDomain Adaptation	—Unverified
A Survey of Knowledge Representation in Service Robotics	Jul 5, 2018	Activity RecognitionBIG-bench Machine Learning	—Unverified
Online Self-supervised Scene Segmentation for Micro Aerial Vehicles	Jun 13, 2018	Scene SegmentationScene Understanding	—Unverified
DenseASPP for Semantic Segmentation in Street Scenes	Jun 1, 2018	Autonomous DrivingImage Segmentation	CodeCode Available
Inferring Shared Attention in Social Scene Videos	Jun 1, 2018	Scene Understanding	—Unverified
3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare	Jun 1, 2018	3D Object ReconstructionAutonomous Driving	—Unverified
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System	May 18, 2018	3D Object DetectionAutonomous Driving	—Unverified
Auxiliary Tasks in Multi-task Learning	May 16, 2018	Depth EstimationMulti-Task Learning	CodeCode Available
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding	May 15, 2018	Scene ClassificationScene Understanding	—Unverified
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing	May 11, 2018	Depth EstimationMulti-Task Learning	—Unverified
Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label Uncertainty	May 2, 2018	Scene UnderstandingSensor Fusion	CodeCode Available
EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction	May 2, 2018	Saliency PredictionScene Understanding	—Unverified

Show:10 25 50

← PrevPage 31 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified