SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 13511400 of 1723 papers

TitleStatusHype
Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics0
Semantic Augmented Reality Environment with Material-Aware Physical Interactions0
Semantic-aware Transmission for Robust Point Cloud Classification0
Semantic Dense Reconstruction with Consistent Scene Segments0
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety0
SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments0
Semantic Foggy Scene Understanding with Synthetic Data0
Classification of Single-View Object Point Clouds0
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting0
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer0
Semantic Is Enough: Only Semantic Information For NeRF Reconstruction0
Beyond Categories: The Visual Memex Model for Reasoning About Object Relationships0
Semantic Motion Segmentation Using Dense CRF Formulation0
Semantic Pose using Deep Networks Trained on Synthetic RGB-D0
Benchmarking Vision Language Models for Cultural Understanding0
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation0
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery0
Semantic segmentation of surgical hyperspectral images under geometric domain shifts0
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics0
VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry0
VLP: Vision Language Planning for Autonomous Driving0
SemanticSplat: Feed-Forward 3D Scene Understanding with Language-Aware Gaussian Fields0
3D Object Aided Self-Supervised Monocular Depth Estimation0
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification0
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding0
Semi-Supervised Learning of Multi-Object 3D Scene Representations0
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors0
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation0
Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes0
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW20240
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism0
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features0
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery0
Separated Inter/Intra-Modal Fusion Prompts for Compositional Zero-Shot Learning0
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving0
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation0
3D-MVP: 3D Multiview Pretraining for Manipulation0
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction0
AVD2: Accident Video Diffusion for Accident Video Description0
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding0
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery0
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM0
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents0
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation0
Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision0
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation0
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition0
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer0
Show:102550
← PrevPage 28 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified