SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 101150 of 318 papers

TitleStatusHype
Panoptic Video Scene Graph GenerationCode1
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph GenerationCode1
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense KnowledgeCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
Two Stream Scene Understanding on Graph Embedding0
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
Towards a Unified Transformer-based Framework for Scene Graph Generation and Human-object Interaction Detection0
Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and Message Passing Neural Network0
FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal Consistency and Correlation Debiasing0
VidCoM: Fast Video Comprehension through Large Language Models with Multimodal Tools0
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph GenerationCode1
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions0
Domain-wise Invariant Learning for Panoptic Scene Graph Generation0
Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationCode0
Less is More: Toward Zero-Shot Local Scene Graph Generation via Foundation ModelsCode1
Logical Bias Learning for Object Relation Prediction0
Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph GenerationCode1
Predicate Classification Using Optimal Transport Loss in Scene Graph Generation0
Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention0
STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation0
Zero-Shot Scene Graph Generation via Triplet Calibration and ReductionCode1
RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation0
Haystack: A Panoptic Scene Graph Dataset to Evaluate Rare Predicate ClassesCode0
Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic GroundingCode0
Head-Tail Cooperative Learning Network for Unbiased Scene Graph GenerationCode0
Vision Relation Transformer for Unbiased Scene Graph GenerationCode1
RLIPv2: Fast Scaling of Relational Language-Image Pre-trainingCode1
3D Scene Graph Prediction on Point Clouds Using Knowledge Graphs0
Compositional Feature Augmentation for Unbiased Scene Graph GenerationCode1
Informative Scene Graph Generation via Debiasing0
Local-Global Information Interaction Debiasing for Dynamic Scene Graph Generation0
Generalized Unbiased Scene Graph Generation0
Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph GenerationCode0
Improving Scene Graph Generation with Superpixel-Based Interaction Learning0
Interpretable End-to-End Driving Model for Implicit Scene Understanding0
Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation0
Panoptic Scene Graph Generation with Semantics-Prototype LearningCode1
Pair then Relation: Pair-Net for Panoptic Scene Graph GenerationCode1
Unbiased Scene Graph Generation via Two-stage Causal Modeling0
Open-Vocabulary Object Detection via Scene Graph Discovery0
Manga109Dialog: A Large-scale Dialogue Dataset for Comics Speaker DetectionCode1
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation0
Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph GenerationCode0
On Certified Generalization in Structured Prediction0
Single-Stage Visual Relationship Learning using Conditional Queries0
Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph GenerationCode0
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene GraphsCode0
Devil's on the Edges: Selective Quad Attention for Scene Graph Generation0
Unbiased Scene Graph Generation in VideosCode1
SPAN: Learning Similarity between Scene Graphs and Images with TransformersCode1
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified