SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12511300 of 1723 papers

TitleStatusHype
Real-time Semantic Segmentation with Context Aggregation Network0
BYE: Build Your Encoder with One Sequence of Exploration Data for Long-Term Dynamic Scene Understanding0
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model0
BUTLER: Building Understanding in TextWorld via Language for Embodied Reasoning0
Building an Affordances Map with Interactive Perception0
Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting0
S^3M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving0
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation0
S4C: Self-Supervised Semantic Scene Completion with Neural Fields0
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments0
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection0
Safety Assessment for Autonomous Systems' Perception Capabilities0
BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization0
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data0
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes0
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation0
SAM-Guided Masked Token Prediction for 3D Scene Understanding0
SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment0
Boundary Seeking GANs0
Scale-aware Neural Network for Semantic Segmentation of Multi-resolution Remote Sensing Images0
SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset0
Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans0
Bottom-up Instance Segmentation using Deep Higher-Order CRFs0
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding0
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation0
Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning0
Scenarios: A New Representation for Complex Scene Understanding0
Scene-aware Human Pose Generation using Transformer0
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation0
Visual Lexicon: Rich Image Features in Language Space0
Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation0
BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight0
SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis0
Counterfactual Critic Multi-Agent Training for Scene Graph Generation0
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes0
SceneGPT: A Language Model for 3D Scene Understanding0
BlindSpotNet: Seeing Where We Cannot See0
Scene Graph Generation: A Comprehensive Survey0
ZRG: A Dataset for Multimodal 3D Residential Rooftop Understanding0
A Comprehensive Survey of Scene Graphs: Generation and Application0
Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding0
Scene-Independent Group Profiling in Crowd0
Scene Map-based Prompt Tuning for Navigation Instruction Generation0
3D Question Answering for City Scene Understanding0
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?0
3D Pose Regression using Convolutional Neural Networks0
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations0
Scene recognition based on DNN and game theory with its applications in human-robot interaction0
SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting0
Blending Learning and Inference in Structured Prediction0
Show:102550
← PrevPage 26 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified