| Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models | Oct 21, 2024 | Instruction Followingobject-detection | CodeCode Available | 0 |
| Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples | May 24, 2023 | DiagnosticReferring Expression | CodeCode Available | 0 |
| Continual Referring Expression Comprehension via Dual Modular Memorization | Nov 25, 2023 | MemorizationReferring Expression | CodeCode Available | 0 |
| A Joint Speaker-Listener-Reinforcer Model for Referring Expressions | Dec 30, 2016 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| Whether you can locate or not? Interactive Referring Expression Generation | Aug 19, 2023 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| NeuralREG: An end-to-end approach to referring expression generation | May 21, 2018 | FormReferring Expression | CodeCode Available | 0 |
| Collecting Visually-Grounded Dialogue with A Game Of Sorts | Sep 10, 2023 | Coreference ResolutionImage Retrieval | CodeCode Available | 0 |
| Modeling Context Between Objects for Referring Expression Understanding | Aug 1, 2016 | Multiple Instance LearningObject | CodeCode Available | 0 |
| MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing | Mar 31, 2025 | Objectobject-detection | CodeCode Available | 0 |
| CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Jan 3, 2019 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension | Feb 17, 2023 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| MAttNet: Modular Attention Network for Referring Expression Comprehension | Jan 24, 2018 | Generalized Referring Expression SegmentationReferring Expression | CodeCode Available | 0 |
| Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations? | Jun 8, 2021 | Referring ExpressionSelf-Driving Cars | CodeCode Available | 0 |
| Referring Expression Comprehension Using Language Adaptive Inference | Jun 6, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters | Mar 28, 2020 | ColorizationImage Colorization | CodeCode Available | 0 |
| Localized Symbolic Knowledge Distillation for Visual Commonsense Models | Dec 8, 2023 | Image DescriptionInstruction Following | CodeCode Available | 0 |
| Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge | Jun 2, 2020 | 16kReferring Expression | CodeCode Available | 0 |
| Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach | Oct 3, 2022 | Referring ExpressionRobot Manipulation | CodeCode Available | 0 |
| Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Sep 9, 2024 | Image RetrievalReferring Expression | CodeCode Available | 0 |
| Using Syntax to Ground Referring Expressions in Natural Images | May 26, 2018 | ObjectReferring Expression | CodeCode Available | 0 |
| Referring Expression Generation Using Entity Profiles | Sep 4, 2019 | Referring ExpressionReferring expression generation | CodeCode Available | 0 |
| Generation and Comprehension of Unambiguous Object Descriptions | Nov 7, 2015 | Image CaptioningObject | CodeCode Available | 0 |
| Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers | May 22, 2023 | Referring Expression | CodeCode Available | 0 |
| Referring Expression Object Segmentation with Caption-Aware Consistency | Oct 10, 2019 | Caption GenerationObject | CodeCode Available | 0 |
| A Real-time Global Inference Network for One-stage Referring Expression Comprehension | Dec 7, 2019 | Diversityfeature selection | CodeCode Available | 0 |
| Improving Quality and Efficiency in Plan-based Neural Data-to-Text Generation | Sep 22, 2019 | Data-to-Text GenerationReferring Expression | CodeCode Available | 0 |
| Adversarial Robustness for Visual Grounding of Multimodal Large Language Models | May 16, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 0 |
| WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation | May 24, 2025 | Contrastive LearningReferring Expression | CodeCode Available | 0 |
| A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training | Aug 20, 2024 | Autonomous VehiclesComputational Efficiency | CodeCode Available | 0 |
| Exploring Modulated Detection Transformer as a Tool for Action Recognition in Videos | Sep 21, 2022 | Action DetectionAction Recognition | CodeCode Available | 0 |
| Learning To Segment Every Referring Object Point by Point | Jan 1, 2023 | ObjectReferring Expression | CodeCode Available | 0 |
| Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation | May 24, 2021 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| 'What are you referring to?' Evaluating the Ability of Multi-Modal Dialogue Models to Process Clarificational Exchanges | Jul 28, 2023 | Referring Expression | CodeCode Available | 0 |
| Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding | Jul 18, 2022 | AttributeReferring Expression | CodeCode Available | 0 |
| Language-Conditioned Feature Pyramids for Visual Selection Tasks | Nov 1, 2020 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments | Apr 23, 2019 | Referring ExpressionVision and Language Navigation | CodeCode Available | 0 |
| Towards Language-guided Visual Recognition via Dynamic Convolutions | Oct 17, 2021 | Question AnsweringReferring Expression | CodeCode Available | 0 |
| Resilience through Scene Context in Visual Referring Expression Generation | Apr 18, 2024 | Referring ExpressionReferring expression generation | CodeCode Available | 0 |
| Towards Omni-supervised Referring Expression Segmentation | Nov 1, 2023 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 0 |
| Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models | Nov 21, 2023 | Image SegmentationLanguage Modelling | CodeCode Available | 0 |
| Revisiting Counterfactual Problems in Referring Expression Comprehension | Jan 1, 2024 | AttributeContrastive Learning | CodeCode Available | 0 |
| Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities | Apr 2, 2025 | DescriptiveLarge Language Model | CodeCode Available | 0 |
| Language Adaptive Weight Generation for Multi-task Visual Grounding | Jun 6, 2023 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| Enriching the WebNLG corpus | Nov 1, 2018 | Machine TranslationReferring Expression | CodeCode Available | 0 |
| Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding | Sep 5, 2019 | ObjectReferring Expression | CodeCode Available | 0 |
| Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation | Apr 22, 2025 | Referring ExpressionReferring expression generation | CodeCode Available | 0 |
| InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation | Nov 30, 2023 | Image CaptioningReferring Expression | CodeCode Available | 0 |
| Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding | Aug 28, 2019 | AttributeReferring Expression | CodeCode Available | 0 |
| Improving Contrastive Learning for Referring Expression Counting | May 28, 2025 | Contrastive LearningObject Counting | CodeCode Available | 0 |
| Grounding Referring Expressions in Images by Variational Context | Dec 5, 2017 | Multiple Instance LearningReferring Expression | CodeCode Available | 0 |