SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 11011150 of 1723 papers

TitleStatusHype
Visual Traffic Knowledge Graph Generation from Scene Images0
Plausible Uncertainties for Human Pose Regression0
Self-Supervised Object Detection from Egocentric Videos0
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs0
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation0
RealGraph: A Multiview Dataset for 4D Real-world Context Graph Generation0
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification0
Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene UnderstandingCode0
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency0
METEOR Guided Divergence for Video CaptioningCode0
Lightweight integration of 3D features to improve 2D image segmentationCode0
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation0
Cross-Domain Synthetic-to-Real In-the-Wild Depth and Normal Estimation for 3D Scene Understanding0
Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data0
Framework for 2D Ad placements in LinearTV0
3D Object Aided Self-Supervised Monocular Depth Estimation0
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding0
Prediction of Scene Plausibility0
SGDraw: Scene Graph Drawing Interface Using Object-Oriented RepresentationCode0
Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene UnderstandingCode0
Learning 3D Scene Priors with 2D Supervision0
PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples0
Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse WeatherCode0
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors0
Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond AberrationsCode0
An Enhanced Object Detection Model for Scene Graph Generation0
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection0
FlowGrad: Using Motion for Visual Sound Source LocalizationCode0
Visually Grounded VQA by Lattice-based RetrievalCode0
User Identification: A Key Enabler for Multi-User Vision-Aided Communications0
Visual Semantic Parsing: From Images to Abstract Meaning Representation0
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation0
Novel 3D Scene Understanding Applications From Recurrence in a Single Image0
Segmentation-guided Domain Adaptation for Efficient Depth Completion0
EarthNets: Empowering AI in Earth Observation0
Uncertainty-aware LiDAR Panoptic SegmentationCode0
Flow-based GAN for 3D Point Cloud Generation from a Single ImageCode0
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding0
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic SegmentationCode0
A Survey on Knowledge Graph-based Methods for Automated Driving0
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents0
Stochastic Future Prediction in Real World Driving Scenarios0
A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding0
Sequential Cross Attention Based Multi-task LearningCode0
Neuromorphic Visual Scene Understanding with Resonator Networks0
Learning in Audio-visual Context: A Review, Analysis, and New Perspective0
Safety Assessment for Autonomous Systems' Perception Capabilities0
Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point SupervisionCode0
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy0
CompNVS: Novel View Synthesis with Scene Completion0
Show:102550
← PrevPage 23 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified