| Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects | Dec 8, 2023 | Image Captioningobject-detection | —Unverified | 0 |
| M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension | Jul 1, 2024 | GPUReferring Expression | —Unverified | 0 |
| Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and Regression | Sep 5, 2024 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments | Nov 15, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Jul 29, 2024 | Image GenerationReferring Expression | —Unverified | 0 |
| A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension | Sep 16, 2019 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction | Dec 21, 2023 | 16kAttribute | —Unverified | 0 |
| Commands 4 Autonomous Vehicles (C4AV) Workshop Summary | Sep 18, 2020 | Autonomous VehiclesReferring Expression Comprehension | —Unverified | 0 |
| Evaluating and Improving Interactions with Hazy Oracles | Oct 19, 2021 | Object TrackingReferring Expression | —Unverified | 0 |
| Modular Graph Attention Network for Complex Visual Relational Reasoning | Nov 22, 2020 | Graph AttentionQuestion Answering | —Unverified | 0 |