SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 701725 of 1723 papers

TitleStatusHype
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention UnderstandingCode0
Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object ExchangeCode0
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge FindingsCode0
Learning Regional Purity for Instance Segmentation on 3D Point CloudsCode0
Learning Panoptic Segmentation from Instance ContoursCode0
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field EstimationCode0
Learning Monocular Depth by Distilling Cross-domain Stereo NetworksCode0
Fast Scene Understanding for Autonomous DrivingCode0
Artificial Color Constancy via GoogLeNet with Angular Loss FunctionCode0
CLAIR-A: Leveraging Large Language Models to Judge Audio CaptionsCode0
False Negative Reduction in Video Instance Segmentation using Uncertainty EstimatesCode0
Implicit Background Estimation for Semantic SegmentationCode0
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry PriorsCode0
InfoNorm: Mutual Information Shaping of Normals for Sparse-View ReconstructionCode0
Leveraging Acoustic Images for Effective Self-Supervised Audio Representation LearningCode0
Label-Attention Transformer with Geometrically Coherent Objects for Image CaptioningCode0
Knowledge-Guided Object Discovery with Acquired Deep ImpressionsCode0
Facing the Void: Overcoming Missing Data in Multi-View ImageryCode0
Joint stereo 3D object detection and implicit surface reconstructionCode0
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random FieldsCode0
Extremely Fine-Grained Visual Classification over Resembling Glyphs in the WildCode0
Adversarial Attacks on Monocular Pose EstimationCode0
Language-based Colorization of Scene SketchesCode0
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic SegmentationCode0
Interpretable Visual Understanding with Cognitive Attention NetworkCode0
Show:102550
← PrevPage 29 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified