SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 11011150 of 1723 papers

TitleStatusHype
Joint Modeling of Visual Objects and Relations for Scene Graph Generation0
Joint Optical Flow and Temporally Consistent Semantic Segmentation0
Joint prototype and coefficient prediction for 3D instance segmentation0
Joint Semantic and Motion Segmentation for dynamic scenes using Deep Convolutional Networks0
Joint SFM and Detection Cues for Monocular 3D Localization in Road Scenes0
JUMPS: Joints Upsampling Method for Pose Sequences0
Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding0
Knowledge Distillation for Incremental Learning in Semantic Segmentation0
Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation0
Label-Efficient LiDAR Panoptic Segmentation0
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding0
Language-Assisted 3D Scene Understanding0
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding0
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving0
Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges0
Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm0
Large Margin Learning of Upstream Scene Understanding Models0
LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
Leaky Wave Antenna-Equipped RF Chipless Tags for Orientation Estimation0
Learning 3D Robotics Perception using Inductive Priors0
Learning 3D Scene Priors with 2D Supervision0
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions0
Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation0
Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey0
Learning-based Relational Object Matching Across Views0
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation0
Learning Densities in Feature Space for Reliable Segmentation of Indoor Scenes0
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length0
Learning Direct Optimization for Scene Understanding0
Learning from Maps: Visual Common Sense for Autonomous Driving0
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation0
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs0
Learning in Audio-visual Context: A Review, Analysis, and New Perspective0
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes0
SceneGPT: A Language Model for 3D Scene Understanding0
Scene Graph Generation: A Comprehensive Survey0
A Comprehensive Survey of Scene Graphs: Generation and Application0
Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding0
Scene-Independent Group Profiling in Crowd0
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?0
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations0
Scene recognition based on DNN and game theory with its applications in human-robot interaction0
SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting0
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames0
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text0
Scene Understanding Enabled Semantic Communication with Open Channel Coding0
Scene Understanding for Autonomous Manipulation with Deep Learning0
Show:102550
← PrevPage 23 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified