SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 851875 of 1723 papers

TitleStatusHype
Dynamic Scene Understanding from Vision-Language Representations0
MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents0
Making Large Language Models Better Planners with Reasoning-Decision Alignment0
Manhattan Scene Understanding via XSlit Imaging0
Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving0
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition0
MapVision: CVPR 2024 Autonomous Grand Challenge Mapless Driving Tech Report0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors0
Dynamic Clustering Transformer Network for Point Cloud Segmentation0
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation0
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding0
DublinCity: Annotated LiDAR Point Cloud and its Applications0
DSNet: An Efficient CNN for Road Scene Segmentation0
Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement0
Adapting to Length Shift: FlexiLength Network for Trajectory Prediction0
DSM: Building A Diverse Semantic Map for 3D Visual Grounding0
Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry0
Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation0
MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning0
Active Scene Understanding via Online Semantic Reconstruction0
A Continuous Occlusion Model for Road Scene Understanding0
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration0
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs0
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving0
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds0
Show:102550
← PrevPage 35 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified