SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 51100 of 318 papers

TitleStatusHype
Bridging Knowledge Graphs to Generate Scene GraphsCode1
Dual-branch Hybrid Learning Network for Unbiased Scene Graph GenerationCode1
Are scene graphs good enough to improve Image Captioning?Code1
Scenes and Surroundings: Scene Graph Generation using Relation TransformerCode1
NODIS: Neural Ordinary Differential Scene UnderstandingCode1
EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity UnderstandingCode1
Leveraging Predicate and Triplet Learning for Scene Graph GenerationCode1
Energy-Based Learning for Scene Graph GenerationCode1
A Review and Efficient Implementation of Scene Graph Generation MetricsCode1
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense KnowledgeCode1
Spatial-Temporal Transformer for Dynamic Scene Graph GenerationCode1
SportsHHI: A Dataset for Human-Human Interaction Detection in Sports VideosCode1
4D-OR: Semantic Scene Graphs for OR Domain ModelingCode1
Linguistic Structures as Weak Supervision for Visual Scene Graph GenerationCode1
OED: Towards One-stage End-to-End Dynamic Scene Graph GenerationCode1
SPAN: Learning Similarity between Scene Graphs and Images with TransformersCode1
Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic SpaceCode1
LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph GenerationCode1
Compositional Feature Augmentation for Unbiased Scene Graph GenerationCode1
Learning and Reasoning with the Graph Structure Representation in Robotic SurgeryCode1
Learning to Generate Scene Graph from Natural Language SupervisionCode1
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Context-Aware Scene Graph Generation With Seq2Seq TransformersCode1
Less is More: Toward Zero-Shot Local Scene Graph Generation via Foundation ModelsCode1
LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial RelationsCode1
Location-Free Scene Graph GenerationCode1
Fine-Grained Predicates Learning for Scene Graph GenerationCode1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph AnalysisCode1
Fine-Grained Scene Graph Generation with Data TransferCode1
HL-Net: Heterophily Learning Network for Scene Graph GenerationCode1
A Fair Ranking and New Model for Panoptic Scene Graph GenerationCode1
From General to Specific: Informative Scene Graph Generation via Balance AdjustmentCode1
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph GenerationCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
Panoptic Scene Graph Generation with Semantics-Prototype LearningCode1
GeneAnnotator: A Semi-automatic Annotation Tool for Visual Scene GraphCode1
CogTree: Cognition Tree Loss for Unbiased Scene Graph GenerationCode1
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph GenerationCode1
Knowledge-Embedded Routing Network for Scene Graph GenerationCode1
Generative Compositional Augmentations for Scene Graph PredictionCode1
GPS-Net: Graph Property Sensing Network for Scene Graph GenerationCode1
Biasing Like Human: A Cognitive Bias Framework for Scene Graph GenerationCode1
Graph Density-Aware Losses for Novel Compositions in Scene Graph GenerationCode1
Graphical Contrastive Losses for Scene Graph ParsingCode1
Learning Visual Commonsense for Robust Scene Graph GenerationCode1
One-shot Scene Graph GenerationCode1
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship DetectionCode1
DIFFVSGG: Diffusion-Driven Online Video Scene Graph GenerationCode1
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph GenerationCode1
SGFormer: Semantic Graph Transformer for Point Cloud-based 3D Scene Graph GenerationCode1
Show:102550
← PrevPage 2 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified