SOTAVerified

Scene Graph Generation

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Papers

Showing 150 of 318 papers

TitleStatusHype
4D Panoptic Scene Graph GenerationCode3
RelationField: Relate Anything in Radiance FieldsCode2
RelTR: Relation Transformer for Scene Graph GenerationCode2
Open World Scene Graph Generation using Vision Language ModelsCode2
EGTR: Extracting Graph from Transformer for Scene Graph GenerationCode2
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language UnderstandingCode2
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language ModelsCode2
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph GenerationCode2
SGTR+: End-to-end Scene Graph Generation with TransformerCode2
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph GenerationCode2
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite ImageryCode2
Learning to Compose Dynamic Tree Structures for Visual ContextsCode2
Unbiased Scene Graph Generation from Biased TrainingCode2
Panoptic Scene Graph GenerationCode2
Learning Visual Commonsense for Robust Scene Graph GenerationCode1
Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic SpaceCode1
Less is More: Toward Zero-Shot Local Scene Graph Generation via Foundation ModelsCode1
SPAN: Learning Similarity between Scene Graphs and Images with TransformersCode1
Knowledge-Embedded Routing Network for Scene Graph GenerationCode1
Learning and Reasoning with the Graph Structure Representation in Robotic SurgeryCode1
Leveraging Predicate and Triplet Learning for Scene Graph GenerationCode1
DIFFVSGG: Diffusion-Driven Online Video Scene Graph GenerationCode1
LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph GenerationCode1
Bridging Knowledge Graphs to Generate Scene GraphsCode1
Are scene graphs good enough to improve Image Captioning?Code1
BusyBot: Learning to Interact, Reason, and Plan in a BusyBoard EnvironmentCode1
Adaptive Self-training Framework for Fine-grained Scene Graph GenerationCode1
A Review and Efficient Implementation of Scene Graph Generation MetricsCode1
Learning to Generate Scene Graph from Natural Language SupervisionCode1
Biasing Like Human: A Cognitive Bias Framework for Scene Graph GenerationCode1
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship DetectionCode1
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph GenerationCode1
GPS-Net: Graph Property Sensing Network for Scene Graph GenerationCode1
Fully Convolutional Scene Graph GenerationCode1
Graph Density-Aware Losses for Novel Compositions in Scene Graph GenerationCode1
HL-Net: Heterophily Learning Network for Scene Graph GenerationCode1
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Dual-branch Hybrid Learning Network for Unbiased Scene Graph GenerationCode1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph AnalysisCode1
GeneAnnotator: A Semi-automatic Annotation Tool for Visual Scene GraphCode1
Generative Compositional Augmentations for Scene Graph PredictionCode1
Graphical Contrastive Losses for Scene Graph ParsingCode1
Graph R-CNN for Scene Graph GenerationCode1
Fine-Grained Predicates Learning for Scene Graph GenerationCode1
Dense Relational Image Captioning via Multi-task Triple-Stream NetworksCode1
Context-Aware Scene Graph Generation With Seq2Seq TransformersCode1
4D-OR: Semantic Scene Graphs for OR Domain ModelingCode1
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph GenerationCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ExpressiveSGGR@10039.12Unverified
2NeuSyRER@10039.1Unverified
3KnowZRelzR@10035.65Unverified
4SpeaQ (without reweighting)Recall@5032.9Unverified
5SpeaQ (with reweighting)Recall@5032.1Unverified
6Causal-TDERecall@5031.93Unverified
7SG-EBMRecall@5031.74Unverified
8GPS-NetRecall@5028.9Unverified
9LOGINRecall@5028.2Unverified
10VCTreeRecall@5027.9Unverified
#ModelMetricClaimedVerifiedStatus
1ORacleF10.91Unverified
2MM2SGF10.9Unverified
3Pix2SGF10.9Unverified
4LABRAD-ORF10.88Unverified
54D-OR baselineF10.75Unverified
#ModelMetricClaimedVerifiedStatus
1SceneGraphFusionTop-5 Accuracy0.87Unverified
23DSSG [Wald2020_3dssg]Top-5 Accuracy0.66Unverified
#ModelMetricClaimedVerifiedStatus
1FactorizableNetRecall@5018.32Unverified
2VRDRecall@5018.16Unverified
#ModelMetricClaimedVerifiedStatus
1KnowZRelzR@10029.56Unverified
#ModelMetricClaimedVerifiedStatus
1MM2SGMacro F10.53Unverified
#ModelMetricClaimedVerifiedStatus
1NeuSyRER@10038.5Unverified