Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1550 of 1723 papers

Title	Date	Tasks	Status	Hype
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation	Sep 24, 2018	Autonomous DrivingReal-Time Semantic Segmentation	CodeCode Available	0
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM	Sep 14, 2018	Bayesian InferenceObject	—Unverified	0
Context-Dependent Diffusion Network for Visual Relationship Detection	Sep 11, 2018	DiversityObject	—Unverified	0
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions	Sep 11, 2018	Question AnsweringScene Understanding	—Unverified	0
On the Importance of Visual Context for Data Augmentation in Scene Understanding	Sep 6, 2018	Data AugmentationInstance Segmentation	—Unverified	0
Modeling human intuitions about liquid flow with particle-based simulation	Sep 5, 2018	Scene Understanding	—Unverified	0
Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?	Sep 5, 2018	3D ReconstructionDepth Estimation	CodeCode Available	0
BOLD5000: A public fMRI dataset of 5000 images	Sep 5, 2018	DiversityScene Understanding	CodeCode Available	0
Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images	Sep 4, 2018	AttributeDynamic Time Warping	CodeCode Available	0
Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular video	Sep 1, 2018	Scene UnderstandingSmall Data Image Classification	—Unverified	0
Localization Guided Learning for Pedestrian Attribute Recognition	Aug 28, 2018	AttributePedestrian Attribute Recognition	—Unverified	0
Single Shot Scene Text Retrieval	Aug 27, 2018	Image RetrievalRetrieval	CodeCode Available	0
COFGA: Classification Of Fine-Grained Features In Aerial Images	Aug 27, 2018	ClassificationGeneral Classification	—Unverified	0
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset	Aug 25, 2018	Deep Reinforcement Learningreinforcement-learning	—Unverified	0
Second-order Democratic Aggregation	Aug 22, 2018	General ClassificationMaterial Classification	—Unverified	0
Deep Learned Full-3D Object Completion from Single View	Aug 21, 2018	3D geometry3D Reconstruction	—Unverified	0
Learning Monocular Depth by Distilling Cross-domain Stereo Networks	Aug 20, 2018	Autonomous DrivingDepth Estimation	CodeCode Available	0
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image	Aug 7, 2018	3D Object DetectionMonocular 3D Object Detection	CodeCode Available	0
Parsing Geometry Using Structure-Aware Shape Templates	Aug 3, 2018	ObjectObject Recognition	CodeCode Available	0
Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding	Aug 3, 2018	Scene UnderstandingSemantic Segmentation	—Unverified	0
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators	Aug 1, 2018	AttributeNatural Questions	—Unverified	0
Unified Perceptual Parsing for Scene Understanding	Jul 26, 2018	2D Semantic SegmentationScene Understanding	CodeCode Available	1
A Reinforcement Learning Approach to Target Tracking in a Camera Network	Jul 26, 2018	Q-Learningreinforcement-learning	—Unverified	0
Three for one and one for three: Flow, Segmentation, and Surface Normals	Jul 19, 2018	Optical Flow EstimationScene Understanding	CodeCode Available	0
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization	Jul 19, 2018	object-detectionObject Detection	—Unverified	0
Visual Affordance and Function Understanding: A Survey	Jul 18, 2018	Affordance DetectionScene Understanding	—Unverified	0
Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning	Jul 16, 2018	3d scene graph generationGraph Generation	CodeCode Available	1
A Reflectance Based Method For Shadow Detection and Removal	Jul 11, 2018	Detecting ShadowsScene Understanding	—Unverified	0
End-to-End Race Driving with Deep Reinforcement Learning	Jul 6, 2018	Deep Reinforcement LearningDomain Adaptation	—Unverified	0
A Survey of Knowledge Representation in Service Robotics	Jul 5, 2018	Activity RecognitionBIG-bench Machine Learning	—Unverified	0
Online Self-supervised Scene Segmentation for Micro Aerial Vehicles	Jun 13, 2018	Scene SegmentationScene Understanding	—Unverified	0
Digging Into Self-Supervised Monocular Depth Estimation	Jun 4, 2018	Camera Pose EstimationDepth Estimation	CodeCode Available	1
3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare	Jun 1, 2018	3D Object ReconstructionAutonomous Driving	—Unverified	0
Inferring Shared Attention in Social Scene Videos	Jun 1, 2018	Scene Understanding	—Unverified	0
DenseASPP for Semantic Segmentation in Street Scenes	Jun 1, 2018	Autonomous DrivingImage Segmentation	CodeCode Available	0
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System	May 18, 2018	3D Object DetectionAutonomous Driving	—Unverified	0
Auxiliary Tasks in Multi-task Learning	May 16, 2018	Depth EstimationMulti-Task Learning	CodeCode Available	0
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding	May 15, 2018	Scene ClassificationScene Understanding	—Unverified	0
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing	May 11, 2018	Depth EstimationMulti-Task Learning	—Unverified	0
Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label Uncertainty	May 2, 2018	Scene UnderstandingSensor Fusion	CodeCode Available	0
EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction	May 2, 2018	Saliency PredictionScene Understanding	—Unverified	0
An Anti-fraud System for Car Insurance Claim Based on Visual Evidence	Apr 30, 2018	Scene Understanding	—Unverified	0
On the iterative refinement of densely connected representation levels for semantic segmentation	Apr 30, 2018	Image SegmentationScene Understanding	CodeCode Available	0
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data	Apr 24, 2018	Data AugmentationScene Understanding	—Unverified	0
Deep cross-domain building extraction for selective depth estimation from oblique aerial imagery	Apr 23, 2018	3D ReconstructionDepth Estimation	—Unverified	0
VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry	Apr 23, 2018	Outdoor LocalizationScene Understanding	—Unverified	0
LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual Semantics	Apr 16, 2018	NavigateScene Understanding	CodeCode Available	0
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation	Apr 12, 2018	Optical Flow EstimationScene Flow Estimation	CodeCode Available	0
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length	Mar 27, 2018	Depth EstimationNetwork Embedding	—Unverified	0
DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects	Mar 27, 2018	General ClassificationObject	CodeCode Available	1

Show:10 25 50

← PrevPage 31 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified