Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 1723 papers

Title	Date	Tasks	Status
Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation	Mar 3, 2021	Autonomous DrivingDepth Estimation	CodeCode Available
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation	Mar 2, 2021	Domain AdaptationScene Understanding	—Unverified
A Kinematic Bottleneck Approach For Pose Regression of Flexible Surgical Instruments directly from Images	Feb 28, 2021	Pose Estimationregression	—Unverified
Audiovisual Highlight Detection in Videos	Feb 11, 2021	Highlight DetectionObject Recognition	—Unverified
Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery	Feb 5, 2021	Earth ObservationScene Understanding	CodeCode Available
Optical flow and scene flow estimation: A survey	Feb 1, 2021	Action RecognitionAutonomous Driving	—Unverified
Deep Learning--Based Scene Simplification for Bionic Vision	Jan 30, 2021	Deep LearningDepth Estimation	CodeCode Available
The Ikshana Hypothesis of Human Scene Understanding	Jan 21, 2021	Representation LearningScene Understanding	CodeCode Available
Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection	Jan 21, 2021	Autonomous NavigationModel Selection	—Unverified
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images	Jan 19, 2021	Depth EstimationMonocular Depth Estimation	—Unverified
BUTLER: Building Understanding in TextWorld via Language for Embodied Reasoning	Jan 1, 2021	Scene Understanding	—Unverified
Non-maximum Suppression Also Closes the Variational Approximation Gap of Multi-object Variational Autoencoders	Jan 1, 2021	ObjectRepresentation Learning	—Unverified
Pseudo Label-Guided Multi Task Learning for Scene Understanding	Jan 1, 2021	Depth EstimationMonocular Depth Estimation	—Unverified
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate	Dec 26, 2020	Scene Text DetectionScene Understanding	—Unverified
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding	Dec 24, 2020	Contrastive LearningRepresentation Learning	—Unverified
Classification of Single-View Object Point Clouds	Dec 18, 2020	3D Object Classification6D Pose Estimation using RGB	—Unverified
Embodied Visual Active Learning for Semantic Segmentation	Dec 17, 2020	Active LearningDeep Reinforcement Learning	—Unverified
Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos	Dec 15, 2020	Autonomous VehiclesCamera Auto-Calibration	—Unverified
Image-Graph-Image Translation via Auto-Encoding	Dec 10, 2020	Scene UnderstandingTranslation	—Unverified
Multi-Model Learning for Real-Time Automotive Semantic Foggy Scene Understanding via Domain Adaptation	Dec 9, 2020	DecoderDomain Adaptation	—Unverified
Competitive Simplicity for Multi-Task Learning for Real-Time Foggy Scene Understanding via Domain Adaptation	Dec 9, 2020	Depth EstimationDomain Adaptation	—Unverified
Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding	Nov 29, 2020	Scene UnderstandingSemantic Segmentation	—Unverified
Multi-task GANs for Semantic Segmentation and Depth Completion with Cycle Consistency	Nov 29, 2020	Autonomous DrivingDepth Completion	—Unverified
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation	Nov 26, 2020	Instance SegmentationScene Understanding	—Unverified
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments	Nov 20, 2020	Graph Generationreinforcement-learning	—Unverified
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition	Nov 8, 2020	Action RecognitionOptical Flow Estimation	—Unverified
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation	Nov 4, 2020	Autonomous DrivingEdge-computing	—Unverified
Learning Regional Purity for Instance Segmentation on 3D Point Clouds	Nov 3, 2020	3D Instance Segmentation3D Semantic Segmentation	CodeCode Available
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds	Nov 2, 2020	Scene Understanding	—Unverified
Highway Driving Dataset for Semantic Video Segmentation	Nov 2, 2020	Autonomous DrivingImage Segmentation	—Unverified
Real-time Semantic Segmentation with Context Aggregation Network	Nov 2, 2020	Real-Time Semantic SegmentationScene Understanding	—Unverified
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics	Oct 20, 2020	Decision MakingLogical Reasoning	—Unverified
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified
Learning Panoptic Segmentation from Instance Contours	Oct 16, 2020	ClusteringInstance Segmentation	CodeCode Available
DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM	Oct 15, 2020	Autonomous DrivingDecision Making	—Unverified
Constructing a Visual Relationship Authenticity Dataset	Oct 11, 2020	Relationship DetectionScene Understanding	CodeCode Available
Be Your Own Best Competitor! Multi-Branched Adversarial Knowledge Transfer	Oct 9, 2020	Decoderimage-classification	—Unverified
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus	Oct 2, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation	Sep 28, 2020	Instance SegmentationPanoptic Segmentation	—Unverified
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified
A Survey on Deep Learning Methods for Semantic Image Segmentation in Real-Time	Sep 27, 2020	Autonomous VehiclesComputational Efficiency	—Unverified
Towards General Purpose Geometry-Preserving Single-View Depth Estimation	Sep 25, 2020	Depth EstimationDiversity	—Unverified
Interactive Learning for Semantic Segmentation in Earth Observation	Sep 23, 2020	Domain AdaptationEarth Observation	CodeCode Available
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation	Sep 7, 2020	Autonomous DrivingDomain Adaptation	—Unverified
On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption	Sep 2, 2020	Scene UnderstandingSegmentation	CodeCode Available
Deep Learning Techniques for Geospatial Data Analysis	Aug 30, 2020	Deep Learningimage-classification	—Unverified
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds	Aug 27, 2020	3D Object RecognitionDeep Learning	—Unverified
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module	Aug 24, 2020	3D Semantic SegmentationAutonomous Driving	—Unverified
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks	Aug 23, 2020	AnatomyData Augmentation	CodeCode Available

Show:10 25 50

← PrevPage 27 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified