SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 276300 of 1723 papers

TitleStatusHype
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous DrivingCode1
Towards Scene Understanding for Autonomous Operations on Airport ApronsCode1
Language-Assisted 3D Feature Learning for Semantic Scene UnderstandingCode1
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object DetectionCode1
RGB-T Semantic Segmentation with Location, Activation, and SharpeningCode1
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real DataCode1
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task modelsCode1
SQA3D: Situated Question Answering in 3D ScenesCode1
Image Masking for Robust Self-Supervised Monocular Depth EstimationCode1
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier ConvolutionsCode1
Uncertainty-Driven Active Vision for Implicit Scene ReconstructionCode1
Dynamic Graph Message Passing Networks for Visual RecognitionCode1
Segmenting Known Objects and Unseen Unknowns without Prior KnowledgeCode1
Leveraging Large (Visual) Language Models for Robot 3D Scene UnderstandingCode1
MassMIND: Massachusetts Maritime INfrared DatasetCode1
SemSegDepth: A Combined Model for Semantic Segmentation and Depth CompletionCode1
Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance SegmentationCode1
TAG: Boosting Text-VQA via Text-aware Visual Question-answer GenerationCode1
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point CloudCode1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous DrivingCode1
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language ModelsCode1
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise BinarizationCode1
Egocentric Scene Understanding via Multimodal Spatial RectifierCode1
Efficient Multi-Task RGB-D Scene Analysis for Indoor EnvironmentsCode1
MCTS with Refinement for Proposals Selection Games in Scene UnderstandingCode1
Show:102550
← PrevPage 12 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified