| A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension | Apr 17, 2022 | Data AugmentationReferring Expression | CodeCode Available | 1 | 5 |
| GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs | Nov 8, 2023 | Question AnsweringReferring Expression | CodeCode Available | 1 | 5 |
| 3D-GRES: Generalized 3D Referring Expression Segmentation | Jul 30, 2024 | ObjectReferring Expression | CodeCode Available | 1 | 5 |
| Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression | Jun 19, 2021 | Instruction FollowingNavigate | CodeCode Available | 1 | 5 |
| SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation | Jun 3, 2024 | Pseudo LabelReferring Expression | CodeCode Available | 1 | 5 |
| PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models | May 23, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Towards Language-guided Visual Recognition via Dynamic Convolutions | Oct 17, 2021 | Question AnsweringReferring Expression | CodeCode Available | 0 | 5 |
| Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities | Apr 2, 2025 | DescriptiveLarge Language Model | CodeCode Available | 0 | 5 |
| Deconfounded Visual Grounding | Dec 31, 2021 | Referring ExpressionVisual Grounding | CodeCode Available | 0 | 5 |
| Grounding Referring Expressions in Images by Variational Context | Dec 5, 2017 | Multiple Instance LearningReferring Expression | CodeCode Available | 0 | 5 |
| Grounding Language in Multi-Perspective Referential Communication | Oct 4, 2024 | Referring ExpressionReferring expression generation | CodeCode Available | 0 | 5 |
| A Joint Speaker-Listener-Reinforcer Model for Referring Expressions | Dec 30, 2016 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 | 5 |
| Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models | Nov 24, 2023 | AllReferring Expression | CodeCode Available | 0 | 5 |
| Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models | Oct 21, 2024 | Instruction Followingobject-detection | CodeCode Available | 0 | 5 |
| Cross-Modal Self-Attention Network for Referring Image Segmentation | Apr 9, 2019 | Image SegmentationReferring Expression | CodeCode Available | 0 | 5 |
| Understanding Synonymous Referring Expressions via Contrastive Features | Apr 20, 2021 | ObjectReferring Expression | CodeCode Available | 0 | 5 |
| Single-Stream Multi-Level Alignment for Vision-Language Pretraining | Mar 27, 2022 | Image-text RetrievalQuestion Answering | CodeCode Available | 0 | 5 |
| Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations? | Jun 8, 2021 | Referring ExpressionSelf-Driving Cars | CodeCode Available | 0 | 5 |
| Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge | Jun 2, 2020 | 16kReferring Expression | CodeCode Available | 0 | 5 |
| Scene-Text Oriented Reffering Expression Comprehension | Nov 4, 2022 | Object LocalizationReferring Expression | CodeCode Available | 0 | 5 |
| Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach | Oct 3, 2022 | Referring ExpressionRobot Manipulation | CodeCode Available | 0 | 5 |
| Searching for Ambiguous Objects in Videos using Relational Referring Expressions | Aug 3, 2019 | Deep AttentionNatural Language Visual Grounding | CodeCode Available | 0 | 5 |
| Generation and Comprehension of Unambiguous Object Descriptions | Nov 7, 2015 | Image CaptioningObject | CodeCode Available | 0 | 5 |
| Towards Omni-supervised Referring Expression Segmentation | Nov 1, 2023 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 0 | 5 |
| Continual Referring Expression Comprehension via Dual Modular Memorization | Nov 25, 2023 | MemorizationReferring Expression | CodeCode Available | 0 | 5 |
| Resilience through Scene Context in Visual Referring Expression Generation | Apr 18, 2024 | Referring ExpressionReferring expression generation | CodeCode Available | 0 | 5 |
| Revisiting Counterfactual Problems in Referring Expression Comprehension | Jan 1, 2024 | AttributeContrastive Learning | CodeCode Available | 0 | 5 |
| Improving Quality and Efficiency in Plan-based Neural Data-to-Text Generation | Sep 22, 2019 | Data-to-Text GenerationReferring Expression | CodeCode Available | 0 | 5 |
| REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments | Apr 23, 2019 | Referring ExpressionVision and Language Navigation | CodeCode Available | 0 | 5 |
| OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework | Feb 7, 2022 | Image Captioningimage-classification | CodeCode Available | 0 | 5 |
| Referring Expression Comprehension Using Language Adaptive Inference | Jun 6, 2023 | object-detectionObject Detection | CodeCode Available | 0 | 5 |
| A Real-time Global Inference Network for One-stage Referring Expression Comprehension | Dec 7, 2019 | Diversityfeature selection | CodeCode Available | 0 | 5 |
| Reasoning About Pragmatics with Neural Listeners and Speakers | Apr 2, 2016 | Referring ExpressionText Generation | CodeCode Available | 0 | 5 |
| Exploring Modulated Detection Transformer as a Tool for Action Recognition in Videos | Sep 21, 2022 | Action DetectionAction Recognition | CodeCode Available | 0 | 5 |
| Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Sep 9, 2024 | Image RetrievalReferring Expression | CodeCode Available | 0 | 5 |
| Collecting Visually-Grounded Dialogue with A Game Of Sorts | Sep 10, 2023 | Coreference ResolutionImage Retrieval | CodeCode Available | 0 | 5 |
| Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding | Jul 18, 2022 | AttributeReferring Expression | CodeCode Available | 0 | 5 |
| Adversarial Robustness for Visual Grounding of Multimodal Large Language Models | May 16, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 0 | 5 |
| Enriching the WebNLG corpus | Nov 1, 2018 | Machine TranslationReferring Expression | CodeCode Available | 0 | 5 |
| Enriching the E2E dataset | Aug 1, 2021 | Referring ExpressionReferring expression generation | CodeCode Available | 0 | 5 |
| Referring Expression Generation Using Entity Profiles | Sep 4, 2019 | Referring ExpressionReferring expression generation | CodeCode Available | 0 | 5 |
| NeuralREG: An end-to-end approach to referring expression generation | May 21, 2018 | FormReferring Expression | CodeCode Available | 0 | 5 |
| CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Jan 3, 2019 | DiagnosticImage Segmentation | CodeCode Available | 0 | 5 |
| Language-Conditioned Feature Pyramids for Visual Selection Tasks | Nov 1, 2020 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 | 5 |
| Language Adaptive Weight Generation for Multi-task Visual Grounding | Jun 6, 2023 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 | 5 |
| Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolution | Sep 27, 2021 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 | 5 |
| Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding | Sep 5, 2019 | ObjectReferring Expression | CodeCode Available | 0 | 5 |
| CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension | Feb 17, 2023 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 | 5 |
| Modeling Context Between Objects for Referring Expression Understanding | Aug 1, 2016 | Multiple Instance LearningObject | CodeCode Available | 0 | 5 |
| Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples | May 24, 2023 | DiagnosticReferring Expression | CodeCode Available | 0 | 5 |