| Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection | Aug 12, 2024 | Human-Object Interaction DetectionZero-Shot Human-Object Interaction Detection | CodeCode Available | 1 | 5 |
| RLIPv2: Fast Scaling of Relational Language-Image Pre-training | Aug 18, 2023 | Graph GenerationHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |
| Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model | May 20, 2023 | DiversityHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |
| ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection | Aug 14, 2020 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation | Apr 1, 2022 | Human-Object Interaction DetectionKnowledge Distillation | CodeCode Available | 1 | 5 |
| Locality-Aware Zero-Shot Human-Object Interaction Detection | May 26, 2025 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning | Apr 24, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Boosting Zero-Shot Human-Object Interaction Detection with Vision-Language Transfer | Mar 18, 2024 | Human-Object Interaction DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration | Mar 12, 2024 | DecoderHuman-Object Interaction Detection | —Unverified | 0 | 0 |