SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 101125 of 318 papers

TitleStatusHype
Prototype-based Embedding Network for Scene Graph GenerationCode1
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language ModelsCode1
Bridging Knowledge Graphs to Generate Scene GraphsCode1
Knowledge-Embedded Routing Network for Scene Graph GenerationCode1
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph GenerationCode1
HL-Net: Heterophily Learning Network for Scene Graph GenerationCode1
Are scene graphs good enough to improve Image Captioning?Code1
PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph GenerationCode1
Relation Transformer NetworkCode1
Structured Sparse R-CNN for Direct Scene Graph GenerationCode1
Unbiased Heterogeneous Scene Graph Generation with Relation-aware Message Passing Neural NetworkCode1
RU-Net: Regularized Unrolling Network for Scene Graph GenerationCode1
EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity UnderstandingCode1
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D SequencesCode1
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph GenerationCode1
Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph GenerationCode0
Situational Scene Graph for Structured Human-centric Situation UnderstandingCode0
Skew Class-balanced Re-weighting for Unbiased Scene Graph GenerationCode0
SGDraw: Scene Graph Drawing Interface Using Object-Oriented RepresentationCode0
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph RetrievalCode0
Scene Graph Generation from Objects, Phrases and Region CaptionsCode0
Image Scene Graph Generation (SGG) BenchmarkCode0
S^2Former-OR: Single-Stage Bi-Modal Transformer for Scene Graph Generation in ORCode0
DSGG: Dense Relation Transformer for an End-to-end Scene Graph GenerationCode0
ReFormer: The Relational Transformer for Image CaptioningCode0
Show:102550
← PrevPage 5 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified