SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 76100 of 318 papers

TitleStatusHype
ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain ModelingCode1
SportsHHI: A Dataset for Human-Human Interaction Detection in Sports VideosCode1
Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labelingCode0
EGTR: Extracting Graph from Transformer for Scene Graph GenerationCode2
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language ModelsCode2
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship DetectionCode1
R3CD: Scene Graph to Image Generation with Relation-aware Compositional Contrastive Control Diffusion0
Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement0
DSGG: Dense Relation Transformer for an End-to-end Scene Graph GenerationCode0
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph GenerationCode2
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition0
Towards Scene Graph AnticipationCode1
S^2Former-OR: Single-Stage Bi-Modal Transformer for Scene Graph Generation in ORCode0
Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning0
SGTR+: End-to-end Scene Graph Generation with TransformerCode2
TD^2-Net: Toward Denoising and Debiasing for Dynamic Scene Graph Generation0
Adaptive Self-training Framework for Fine-grained Scene Graph GenerationCode1
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models0
CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning0
Contextual Associated Triplet Queries for Panoptic Scene Graph Generation0
ALF: Adaptive Label Finetuning for Scene Graph Generation0
Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies0
GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific NarrativesCode0
HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding0
HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified