SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 150 of 318 papers

TitleStatusHype
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery0
CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations0
HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions0
Open World Scene Graph Generation using Vision Language ModelsCode2
Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments0
EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity UnderstandingCode1
A Reverse Causal Framework to Mitigate Spurious Correlations for Debiasing Scene Graph Generation0
LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical StudyCode0
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation0
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph RetrievalCode0
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling0
Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension0
Robo-SGG: Exploiting Layout-Oriented Normalization and Restitution for Robust Scene Graph Generation0
Generalized Visual Relation Detection with Diffusion Models0
SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos0
NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving0
A Causal Adjustment Module for Debiasing Scene Graph Generation0
Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation0
What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?0
Universal Scene Graph Generation0
DIFFVSGG: Diffusion-Driven Online Video Scene Graph GenerationCode1
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation0
FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction0
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing0
Weakly Supervised Video Scene Graph Generation via Natural Language SupervisionCode1
KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph GenerationCode0
Leveraging V2X for Collaborative HD Maps Construction Using Scene Graph Generation0
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation0
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene0
Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features0
Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation0
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation0
RelationField: Relate Anything in Radiance FieldsCode2
RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype LearningCode1
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation0
Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph GenerationCode0
Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation0
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language ModelsCode1
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation0
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation0
Unbiased Scene Graph Generation by Type-Aware Message Passing on Heterogeneous and Dual Graphs0
Federated Voxel Scene Graph for Intracranial HemorrhageCode0
Situational Scene Graph for Structured Human-centric Situation UnderstandingCode0
Scene Graph Generation with Role-Playing Large Language ModelsCode1
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation0
Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation0
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data0
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified