SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 901950 of 1723 papers

TitleStatusHype
Towards Scene Understanding for Autonomous Operations on Airport ApronsCode1
Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding0
3D Object Aided Self-Supervised Monocular Depth Estimation0
Prediction of Scene Plausibility0
SGDraw: Scene Graph Drawing Interface Using Object-Oriented RepresentationCode0
PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingCode2
Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene UnderstandingCode0
OpenScene: 3D Scene Understanding with Open VocabulariesCode2
Learning 3D Scene Priors with 2D Supervision0
Language-Assisted 3D Feature Learning for Semantic Scene UnderstandingCode1
PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples0
Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse WeatherCode0
Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond AberrationsCode0
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors0
An Enhanced Object Detection Model for Scene Graph Generation0
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object DetectionCode1
FlowGrad: Using Motion for Visual Sound Source LocalizationCode0
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection0
Visually Grounded VQA by Lattice-based RetrievalCode0
User Identification: A Key Enabler for Multi-User Vision-Aided Communications0
RGB-T Semantic Segmentation with Location, Activation, and SharpeningCode1
Visual Semantic Parsing: From Images to Abstract Meaning Representation0
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real DataCode1
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task modelsCode1
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation0
Segmentation-guided Domain Adaptation for Efficient Depth Completion0
Novel 3D Scene Understanding Applications From Recurrence in a Single Image0
SQA3D: Situated Question Answering in 3D ScenesCode1
EarthNets: Empowering AI in Earth Observation0
Uncertainty-aware LiDAR Panoptic SegmentationCode0
Flow-based GAN for 3D Point Cloud Generation from a Single ImageCode0
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding0
Image Masking for Robust Self-Supervised Monocular Depth EstimationCode1
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier ConvolutionsCode1
Uncertainty-Driven Active Vision for Implicit Scene ReconstructionCode1
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic SegmentationCode0
A Survey on Knowledge Graph-based Methods for Automated Driving0
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents0
Stochastic Future Prediction in Real World Driving Scenarios0
Dynamic Graph Message Passing Networks for Visual RecognitionCode1
A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding0
Segmenting Known Objects and Unseen Unknowns without Prior KnowledgeCode1
Leveraging Large (Visual) Language Models for Robot 3D Scene UnderstandingCode1
MassMIND: Massachusetts Maritime INfrared DatasetCode1
Sequential Cross Attention Based Multi-task LearningCode0
SemSegDepth: A Combined Model for Semantic Segmentation and Depth CompletionCode1
Neuromorphic Visual Scene Understanding with Resonator Networks0
Learning in Audio-visual Context: A Review, Analysis, and New Perspective0
Safety Assessment for Autonomous Systems' Perception Capabilities0
Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point SupervisionCode0
Show:102550
← PrevPage 19 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified