SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 14011450 of 1723 papers

TitleStatusHype
Single Image 3D Without a Single 3D Image0
Single Image Depth Estimation: An Overview0
Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning0
3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning0
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy0
You Only Scan Once: A Dynamic Scene Reconstruction Pipeline for 6-DoF Robotic Grasping of Novel Objects0
Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture0
Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision0
Waymo Open Dataset: Panoramic Video Panoptic Segmentation0
SkyScenes: A Synthetic Dataset for Aerial Scene Understanding0
SLGaussian: Fast Language Gaussian Splatting in Sparse Views0
Weakly Supervised 3D Instance Segmentation without Instance-level Annotations0
Small Drone Field Experiment: Data Collection & Processing0
Small-Variance Nonparametric Clustering on the Hypersphere0
Smart Infrastructure: A Research Junction0
Audiovisual Highlight Detection in Videos0
SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding0
Audio-visual Event Localization on Portrait Mode Short Videos0
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition0
3D Gated Recurrent Fusion for Semantic Scene Completion0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications0
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images0
So you think you can track?0
SparseLGS: Sparse View Language Embedded Gaussian Splatting0
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models0
Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment0
Attributes-aware Visual Emotion Representation Learning0
SpatialLM: Training Large Language Models for Structured Indoor Modeling0
Spatial Sampling Network for Fast Scene Understanding0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data0
Attention Mechanism based Cognition-level Scene Understanding0
SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis0
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation0
SpeedMachines: Anytime Structured Prediction0
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos0
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification0
A transition towards virtual representations of visual scenes0
SplatTalk: 3D VQA with Gaussian Splatting0
SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds0
A Task-Oriented Approach for Cost-Sensitive Recognition0
A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision0
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views0
SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images0
A Survey on Knowledge Graph-based Methods for Automated Driving0
A Survey on Deep Learning Methods for Semantic Image Segmentation in Real-Time0
A Survey of Knowledge Representation in Service Robotics0
StandardSim: A Synthetic Dataset For Retail Environments0
Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection0
Stochastic Future Prediction in Real World Driving Scenarios0
Show:102550
← PrevPage 29 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified