SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 276300 of 318 papers

TitleStatusHype
ReFormer: The Relational Transformer for Image CaptioningCode0
Image Scene Graph Generation (SGG) Benchmark0
Predicate correlation learning for scene graph generation0
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval0
Segmentation-grounded Scene Graph Generation0
Understanding the Role of Scene Graphs in Visual Question Answering0
Counterfactual Thinking for Long-tailed Information Extraction0
Topic Scene Graph Generation by Attention Distillation From Caption0
A Simple Baseline for Weakly-Supervised Scene Graph Generation0
Self-Supervised Real-to-Sim Scene Generation0
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments0
Dual ResGCN for Balanced Scene GraphGeneration0
After All, Only The Last Neuron Matters: Comparing Multi-modal Fusion Functions for Scene Graph GenerationCode0
Sim2SG: Sim-to-Real Scene Graph Generation for Transfer Learning0
Exploring the Hierarchy in Relation Labels for Scene Graph Generation0
Tackling the Unannotated: Scene Graph Generation with Bias-Reduced Models0
HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation0
Assisting Scene Graph Generation with Self-Supervision0
Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"0
Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation0
Visual Relationship Detection using Scene Graphs: A Survey0
Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction0
Leveraging Auxiliary Text for Deep Recognition of Unseen Visual Relationships0
The Limited Multi-Label Projection LayerCode0
Learning Predicates as Functions to Enable Few-shot Scene Graph Prediction0
Show:102550
← PrevPage 12 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified