SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 751775 of 1723 papers

TitleStatusHype
3D Vision-Language Gaussian Splatting0
Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy0
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users0
Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection0
CASPNet++: Joint Multi-Agent Motion Prediction0
Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks0
ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding0
Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios0
Cascaded Classification Models: Combining Models for Holistic Scene Understanding0
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation0
Car Segmentation and Pose Estimation using 3D Object Models0
Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
Enhancing image captioning with depth information using a Transformer-based framework0
Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning0
Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving0
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding0
Multilateral Cascading Network for Semantic Segmentation of Large-Scale Outdoor Point Clouds0
Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps0
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators0
Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection0
3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing0
End-to-End Race Driving with Deep Reinforcement Learning0
End-to-end Autonomous Driving using Deep Learning: A Systematic Review0
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving0
Show:102550
← PrevPage 31 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified