Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1275 of 1723 papers

Title	Date	Tasks	Status	Hype
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments	Nov 20, 2020	Graph Generationreinforcement-learning	—Unverified	0
RELLIS-3D Dataset: Data, Benchmarks and Analysis	Nov 17, 2020	3D Semantic SegmentationAutonomous Navigation	CodeCode Available	1
SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments	Nov 9, 2020	Autonomous DrivingDepth Estimation	CodeCode Available	1
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition	Nov 8, 2020	Action RecognitionOptical Flow Estimation	—Unverified	0
Towards Efficient Scene Understanding via Squeeze Reasoning	Nov 6, 2020	Instance Segmentationobject-detection	CodeCode Available	1
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding	Nov 4, 2020	Multi-Task LearningScene Understanding	CodeCode Available	2
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation	Nov 4, 2020	Autonomous DrivingEdge-computing	—Unverified	0
Learning Regional Purity for Instance Segmentation on 3D Point Clouds	Nov 3, 2020	3D Instance Segmentation3D Semantic Segmentation	CodeCode Available	0
Highway Driving Dataset for Semantic Video Segmentation	Nov 2, 2020	Autonomous DrivingImage Segmentation	—Unverified	0
Real-time Semantic Segmentation with Context Aggregation Network	Nov 2, 2020	Real-Time Semantic SegmentationScene Understanding	—Unverified	0
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds	Nov 2, 2020	Scene Understanding	—Unverified	0
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation	Oct 30, 2020	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model	Oct 25, 2020	Depth EstimationDepth Prediction	CodeCode Available	1
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics	Oct 20, 2020	Decision MakingLogical Reasoning	—Unverified	0
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified	0
Learning Panoptic Segmentation from Instance Contours	Oct 16, 2020	ClusteringInstance Segmentation	CodeCode Available	0
DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM	Oct 15, 2020	Autonomous DrivingDecision Making	—Unverified	0
Constructing a Visual Relationship Authenticity Dataset	Oct 11, 2020	Relationship DetectionScene Understanding	CodeCode Available	0
Be Your Own Best Competitor! Multi-Branched Adversarial Knowledge Transfer	Oct 9, 2020	Decoderimage-classification	—Unverified	0
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning	Oct 8, 2020	Natural Language Visual GroundingScene Understanding	CodeCode Available	1
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified	0
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus	Oct 2, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available	0
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding	Oct 1, 2020	Deep Learningimage-classification	CodeCode Available	1
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified	0

Show:10 25 50

← PrevPage 51 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified