Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 1723 papers

Title	Date	Tasks	Status	Hype
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments	Nov 20, 2020	Graph Generationreinforcement-learning	—Unverified	0
RELLIS-3D Dataset: Data, Benchmarks and Analysis	Nov 17, 2020	3D Semantic SegmentationAutonomous Navigation	CodeCode Available	1
SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments	Nov 9, 2020	Autonomous DrivingDepth Estimation	CodeCode Available	1
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition	Nov 8, 2020	Action RecognitionOptical Flow Estimation	—Unverified	0
Towards Efficient Scene Understanding via Squeeze Reasoning	Nov 6, 2020	Instance Segmentationobject-detection	CodeCode Available	1
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding	Nov 4, 2020	Multi-Task LearningScene Understanding	CodeCode Available	2
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation	Nov 4, 2020	Autonomous DrivingEdge-computing	—Unverified	0
Learning Regional Purity for Instance Segmentation on 3D Point Clouds	Nov 3, 2020	3D Instance Segmentation3D Semantic Segmentation	CodeCode Available	0
Highway Driving Dataset for Semantic Video Segmentation	Nov 2, 2020	Autonomous DrivingImage Segmentation	—Unverified	0
Real-time Semantic Segmentation with Context Aggregation Network	Nov 2, 2020	Real-Time Semantic SegmentationScene Understanding	—Unverified	0
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds	Nov 2, 2020	Scene Understanding	—Unverified	0
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation	Oct 30, 2020	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model	Oct 25, 2020	Depth EstimationDepth Prediction	CodeCode Available	1
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics	Oct 20, 2020	Decision MakingLogical Reasoning	—Unverified	0
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified	0
Learning Panoptic Segmentation from Instance Contours	Oct 16, 2020	ClusteringInstance Segmentation	CodeCode Available	0
DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM	Oct 15, 2020	Autonomous DrivingDecision Making	—Unverified	0
Constructing a Visual Relationship Authenticity Dataset	Oct 11, 2020	Relationship DetectionScene Understanding	CodeCode Available	0
Be Your Own Best Competitor! Multi-Branched Adversarial Knowledge Transfer	Oct 9, 2020	Decoderimage-classification	—Unverified	0
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning	Oct 8, 2020	Natural Language Visual GroundingScene Understanding	CodeCode Available	1
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified	0
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus	Oct 2, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available	0
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding	Oct 1, 2020	Deep Learningimage-classification	CodeCode Available	1
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified	0
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation	Sep 28, 2020	Instance SegmentationPanoptic Segmentation	—Unverified	0
A Survey on Deep Learning Methods for Semantic Image Segmentation in Real-Time	Sep 27, 2020	Autonomous VehiclesComputational Efficiency	—Unverified	0
Towards General Purpose Geometry-Preserving Single-View Depth Estimation	Sep 25, 2020	Depth EstimationDiversity	—Unverified	0
Interactive Learning for Semantic Segmentation in Earth Observation	Sep 23, 2020	Domain AdaptationEarth Observation	CodeCode Available	0
BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments	Sep 22, 2020	Domain AdaptationScene Understanding	CodeCode Available	1
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation	Sep 7, 2020	Autonomous DrivingDomain Adaptation	—Unverified	0
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges	Sep 7, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available	1
On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption	Sep 2, 2020	Scene UnderstandingSegmentation	CodeCode Available	0
Deep Learning Techniques for Geospatial Data Analysis	Aug 30, 2020	Deep Learningimage-classification	—Unverified	0
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds	Aug 27, 2020	3D Object RecognitionDeep Learning	—Unverified	0
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module	Aug 24, 2020	3D Semantic SegmentationAutonomous Driving	—Unverified	0
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks	Aug 23, 2020	AnatomyData Augmentation	CodeCode Available	0
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities	Aug 14, 2020	Representation LearningScene Understanding	CodeCode Available	0
DAWN: Vehicle Detection in Adverse Weather Nature Dataset	Aug 12, 2020	Autonomous DrivingScene Understanding	—Unverified	0
Factor Graph based 3D Multi-Object Tracking in Point Clouds	Aug 12, 2020	3D Multi-Object TrackingMulti-Object Tracking	—Unverified	0
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene	Aug 11, 2020	Instance SegmentationPoint Cloud Segmentation	CodeCode Available	1
Polysemy Deciphering Network for Robust Human-Object Interaction Detection	Aug 7, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Global Context Aware Convolutions for 3D Point Cloud Understanding	Aug 7, 2020	Point Cloud ClassificationRetrieval	—Unverified	0
Pose-based Modular Network for Human-Object Interaction Detection	Aug 5, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning	Aug 1, 2020	Cross-Modal RetrievalRepresentation Learning	CodeCode Available	0
Polysemy Deciphering Network for Human-Object Interaction Detection	Aug 1, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Weakly Supervised 3D Object Detection from Point Clouds	Jul 28, 2020	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Virtual Multi-view Fusion for 3D Semantic Segmentation	Jul 26, 2020	2D Semantic Segmentation3D Semantic Segmentation	CodeCode Available	1
OpenRooms: An End-to-End Open Framework for Photorealistic Indoor Scene Datasets	Jul 25, 2020	FrictionInverse Rendering	—Unverified	0
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild	Jul 23, 2020	Few-Shot Object DetectionMeta-Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 26 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified