SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12011250 of 1723 papers

TitleStatusHype
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving0
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction0
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding0
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery0
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation0
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation0
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition0
Single Image 3D Without a Single 3D Image0
Single Image Depth Estimation: An Overview0
Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning0
Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture0
SkyScenes: A Synthetic Dataset for Aerial Scene Understanding0
SLGaussian: Fast Language Gaussian Splatting in Sparse Views0
Small Drone Field Experiment: Data Collection & Processing0
Small-Variance Nonparametric Clustering on the Hypersphere0
Smart Infrastructure: A Research Junction0
SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding0
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications0
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images0
So you think you can track?0
SparseLGS: Sparse View Language Embedded Gaussian Splatting0
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models0
SpatialLM: Training Large Language Models for Structured Indoor Modeling0
Spatial Sampling Network for Fast Scene Understanding0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data0
SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis0
SpeedMachines: Anytime Structured Prediction0
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos0
SplatTalk: 3D VQA with Gaussian Splatting0
SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds0
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views0
SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images0
StandardSim: A Synthetic Dataset For Retail Environments0
Stochastic Future Prediction in Real World Driving Scenarios0
Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph0
Semantic and structural image segmentation for prosthetic vision0
Structural Concept Learning via Graph Attention for Multi-Level Rearrangement Planning0
Structured agents for physical construction0
Structured Generative Models for Scene Understanding0
Neural Language of Thought Models0
Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos0
Submodular Field Grammars: Representation, Inference, and Application to Image Parsing0
SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite0
SUPER: A Novel Lane Detection System0
SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians0
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review0
SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection0
SurGNN: Explainable visual scene understanding and assessment of surgical skill using graph neural networks0
Show:102550
← PrevPage 25 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified