SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 376400 of 1723 papers

TitleStatusHype
Learning Triadic Belief Dynamics in Nonverbal Communication from VideosCode1
Multi-View Radar Semantic SegmentationCode1
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D SequencesCode1
Bidirectional Projection Network for Cross Dimension Scene UnderstandingCode1
Tracking Pedestrian Heads in Dense CrowdCode1
Relation-aware Instance Refinement for Weakly Supervised Visual GroundingCode1
OFFSEG: A Semantic Segmentation Framework For Off-Road DrivingCode1
Detecting Human-Object Interaction via Fabricated Compositional LearningCode1
Monte Carlo Scene Search for 3D Scene UnderstandingCode1
Holistic 3D Scene Understanding from a Single Image with Implicit RepresentationCode1
Affect2MM: Affective Analysis of Multimedia Content Using Emotion CausalityCode1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph AnalysisCode1
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive LearningCode1
FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud SegmentationCode1
Boundary-induced and scene-aggregated network for monocular depth predictionCode1
4D Panoptic LiDAR SegmentationCode1
RGB-D Railway Platform Monitoring and Scene Understanding for Enhanced Passenger SafetyCode1
Weakly Supervised Learning of Rigid 3D Scene FlowCode1
A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed ImagesCode1
Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout Estimation from Spherical PanoramasCode1
OpenGF: An Ultra-Large-Scale Ground Filtering Dataset Built Upon Open ALS Point Clouds Around the WorldCode1
Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor SetupsCode1
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship DetectionCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Event-based Motion Segmentation with Spatio-Temporal Graph CutsCode1
Show:102550
← PrevPage 16 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified