SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12511300 of 1723 papers

TitleStatusHype
SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Symbolic Graph Inference for Compound Scene Understanding0
Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation0
Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities0
SynthCam3D: Semantic Understanding With Synthetic Indoor Scenes0
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery0
Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander0
Tactile MNIST: Benchmarking Active Tactile Perception0
TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning0
TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning0
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot0
TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs0
TARS: Traffic-Aware Radar Scene Flow Estimation0
Taskology: Utilizing Task Relations at Scale0
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances0
Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding0
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction0
Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation0
Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation0
Test-Time Intensity Consistency Adaptation for Shadow Detection0
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions0
Text-to-Image GAN with Pretrained Representations0
Texture Underfitting for Domain Adaptation0
TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking0
TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness0
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation0
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes0
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes0
The Right (Angled) Perspective: Improving the Understanding of Road Scenes Using Boosted Inverse Perspective Mapping0
These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models0
-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding0
The toulouse vanishing points dataset0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
To complete or to estimate, that is the question: A Multi-Task Approach to Depth Completion and Monocular Depth Estimation0
TopoMask: Instance-Mask-Based Formulation for the Road Topology Problem via Transformer-Based Architecture0
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module0
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning0
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases0
Towards 3D Scene Understanding by Referring Synthetic Models0
Towards Adapting ImageNet to Reality: Scalable Domain Adaptation with Implicit Low-rank Transformations0
Towards A Unified Agent with Foundation Models0
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation0
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering0
Towards General Purpose Geometry-Preserving Single-View Depth Estimation0
Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models0
Towards holistic scene understanding: Semantic segmentation and beyond0
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data0
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents0
Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin-based Scene Representation0
Show:102550
← PrevPage 26 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified