SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 11261150 of 1723 papers

TitleStatusHype
Learning-based Relational Object Matching Across Views0
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation0
Learning Densities in Feature Space for Reliable Segmentation of Indoor Scenes0
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length0
Learning Direct Optimization for Scene Understanding0
Learning from Maps: Visual Common Sense for Autonomous Driving0
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation0
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs0
Learning in Audio-visual Context: A Review, Analysis, and New Perspective0
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes0
SceneGPT: A Language Model for 3D Scene Understanding0
Scene Graph Generation: A Comprehensive Survey0
A Comprehensive Survey of Scene Graphs: Generation and Application0
Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding0
Scene-Independent Group Profiling in Crowd0
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?0
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations0
Scene recognition based on DNN and game theory with its applications in human-robot interaction0
SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting0
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames0
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text0
Scene Understanding Enabled Semantic Communication with Open Channel Coding0
Scene Understanding for Autonomous Manipulation with Deep Learning0
Show:102550
← PrevPage 46 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified