SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 651675 of 1723 papers

TitleStatusHype
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and ModalitiesCode0
Matterport3D: Learning from RGB-D Data in Indoor EnvironmentsCode0
MC-PanDA: Mask Confidence for Panoptic Domain AdaptationCode0
Gated Driver Attention PredictorCode0
Gated2Depth: Real-time Dense Lidar from Gated ImagesCode0
METEOR Guided Divergence for Video CaptioningCode0
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic SegmentationCode0
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural NetworksCode0
Loss Distillation via Gradient Matching for Point Cloud Completion with Weighted Chamfer DistanceCode0
Cognitive Visual Commonsense Reasoning Using Dynamic Working MemoryCode0
Loss Switching Fusion with Similarity Search for Video ClassificationCode0
LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual SemanticsCode0
MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth EstimationCode0
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the WildCode0
Lightweight integration of 3D features to improve 2D image segmentationCode0
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural ImagesCode0
From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot DetectionCode0
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene UnderstandingCode0
From Feature Importance to Natural Language Explanations Using LLMs with RAGCode0
CNN-based Lidar Point Cloud De-Noising in Adverse WeatherCode0
Leveraging Acoustic Images for Effective Self-Supervised Audio Representation LearningCode0
LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action RecognitionCode0
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene UnderstandingCode0
Learning Regional Purity for Instance Segmentation on 3D Point CloudsCode0
Learning Panoptic Segmentation from Instance ContoursCode0
Show:102550
← PrevPage 27 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified