SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 9511000 of 1723 papers

TitleStatusHype
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation0
MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views0
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields0
Natural Language Guided Visual Relationship Detection0
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset0
Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images0
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction0
Near, far: Patch-ordering enhances vision foundation models' scene understanding0
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation0
Designing DNNs for a trade-off between robustness and processing performance in embedded devices0
Neural Implicit Dense Semantic SLAM0
Neural Mesh Refiner for 6-DoF Pose Estimation0
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans0
Neural Projection Mapping Using Reflectance Fields0
Neural Radiance Field-based Visual Rendering: A Comprehensive Review0
Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding0
Neural Radiance Fields for the Real World: A Survey0
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects0
DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations0
Neural RGB(r)D Sensing: Depth and Uncertainty From a Video Camera0
Neural Scene De-Rendering0
Neuromorphic Visual Scene Understanding with Resonator Networks0
Designing Deep Networks for Surface Normal Estimation0
Newtonian Scene Understanding: Unfolding the Dynamics of Objects in Static Images0
Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration0
Unsupervised Foveal Vision Neural Networks with Top-Down Attention0
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding0
Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods0
Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization0
Non-maximum Suppression Also Closes the Variational Approximation Gap of Multi-object Variational Autoencoders0
Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation0
Depth Not Needed - An Evaluation of RGB-D Feature Encodings for Off-Road Scene Understanding by Convolutional Neural Network0
Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation0
Not Using the Car to See the Sidewalk -- Quantifying and Controlling the Effects of Context in Classification and Segmentation0
Novel 3D Scene Understanding Applications From Recurrence in a Single Image0
Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views0
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving0
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation0
Depth Estimation using Weighted-loss and Transfer Learning0
O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation0
DepthCut: Improved Depth Edge Estimation Using Multiple Unreliable Channels0
Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings0
Object as Distribution0
Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks0
Object-Aware DINO (Oh-A-Dino): Enhancing Self-Supervised Representations for Multi-Object Instance Retrieval0
Object Aware Egocentric Online Action Detection0
AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model0
Object-Centric Scene Representations using Active Inference0
Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization0
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors0
Show:102550
← PrevPage 20 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified