| Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection | Mar 26, 2024 | RelationRelationship Detection | CodeCode Available | 1 |
| Distance-Aware Occlusion Detection with Focused Attention | Aug 23, 2022 | DecoderHuman-Object Interaction Detection | CodeCode Available | 1 |
| Neural Message Passing for Visual Relationship Detection | Aug 8, 2022 | Relationship DetectionVisual Relationship Detection | CodeCode Available | 1 |
| 2.5D Visual Relationship Detection | Apr 26, 2021 | BenchmarkingDepth Estimation | CodeCode Available | 1 |
| Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection | Jan 1, 2021 | Common Sense ReasoningGraph Generation | CodeCode Available | 1 |
| LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos | Dec 17, 2020 | Human-Object Interaction DetectionRelationship Detection | CodeCode Available | 1 |
| One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks | Nov 21, 2020 | AllInstance Segmentation | CodeCode Available | 1 |
| Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations | Sep 10, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks | Jun 16, 2020 | Graph Neural NetworkHuman-Object Interaction Detection | CodeCode Available | 1 |
| NODIS: Neural Ordinary Differential Scene Understanding | Jan 14, 2020 | AllGraph Generation | CodeCode Available | 1 |
| Compensating Supervision Incompleteness with Prior Knowledge in Semantic Image Interpretation | Oct 1, 2019 | ObjectRelation | CodeCode Available | 1 |
| Graphical Contrastive Losses for Scene Graph Parsing | Mar 7, 2019 | Relationship DetectionScene Graph Generation | CodeCode Available | 1 |
| Large-Scale Visual Relationship Understanding | Apr 27, 2018 | Relationship Detection | CodeCode Available | 1 |
| METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection | May 10, 2025 | Objectobject-detection | CodeCode Available | 0 |
| End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting | Sep 19, 2024 | DecoderObject | —Unverified | 0 |
| A Review of Human-Object Interaction Detection | Aug 20, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching | Jun 5, 2024 | cross-modal alignmentImage-text matching | —Unverified | 0 |
| AUG: A New Dataset and An Efficient Model for Aerial Image Urban Scene Graph Generation | Apr 11, 2024 | Graph GenerationRelationship Detection | —Unverified | 0 |
| Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection | Mar 21, 2024 | DecoderObject | —Unverified | 0 |
| Video Relationship Detection Using Mixture of Experts | Mar 6, 2024 | Action RecognitionMixture-of-Experts | CodeCode Available | 0 |
| RelVAE: Generative Pretraining for few-shot Visual Relationship Detection | Nov 27, 2023 | Predicate ClassificationRelationship Detection | —Unverified | 0 |
| Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction | Nov 8, 2023 | Predicate DetectionRelationship Detection | CodeCode Available | 0 |
| STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning | Sep 13, 2023 | RelationRelationship Detection | CodeCode Available | 0 |
| NeSy4VRD: A Multifaceted Resource for Neurosymbolic AI Research using Knowledge Graphs in Visual Relationship Detection | May 22, 2023 | Knowledge GraphsRelationship Detection | —Unverified | 0 |
| MMRDN: Consistent Representation for Multi-View Manipulation Relationship Detection in Object-Stacked Scenes | Apr 25, 2023 | PositionRelationship Detection | —Unverified | 0 |