SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 13761400 of 1723 papers

TitleStatusHype
Semi-Supervised Learning of Multi-Object 3D Scene Representations0
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors0
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation0
Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes0
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW20240
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism0
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features0
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery0
Separated Inter/Intra-Modal Fusion Prompts for Compositional Zero-Shot Learning0
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving0
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation0
3D-MVP: 3D Multiview Pretraining for Manipulation0
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction0
AVD2: Accident Video Diffusion for Accident Video Description0
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding0
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery0
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM0
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents0
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation0
Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision0
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation0
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition0
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer0
Show:102550
← PrevPage 56 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified