Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 364 papers

Title	Date	Tasks	Status	Hype	Score
Large-Scale Adversarial Training for Vision-and-Language Representation Learning	Jun 11, 2020	Image-text RetrievalQuestion Answering	CodeCode Available	1	5
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs	Nov 8, 2023	Question AnsweringReferring Expression	CodeCode Available	1	5
URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark	Aug 1, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Zero-shot Referring Image Segmentation with Global-Local Context Features	Mar 31, 2023	Image SegmentationReferring Expression	CodeCode Available	1	5
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning	Mar 9, 2021	Deep Reinforcement LearningReferring Expression	CodeCode Available	1	5
Human-centric Spatio-Temporal Video Grounding With Visual Transformers	Nov 10, 2020	Referring ExpressionSentence	CodeCode Available	1	5
Understanding Synonymous Referring Expressions via Contrastive Features	Apr 20, 2021	ObjectReferring Expression	CodeCode Available	0	5
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework	Feb 7, 2022	Image Captioningimage-classification	CodeCode Available	0	5
Deconfounded Visual Grounding	Dec 31, 2021	Referring ExpressionVisual Grounding	CodeCode Available	0	5
Grounding Language in Multi-Perspective Referential Communication	Oct 4, 2024	Referring ExpressionReferring expression generation	CodeCode Available	0	5
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions	Dec 30, 2016	Referring ExpressionReferring Expression Comprehension	CodeCode Available	0	5
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities	Apr 2, 2025	DescriptiveLarge Language Model	CodeCode Available	0	5
Cross-Modal Self-Attention Network for Referring Image Segmentation	Apr 9, 2019	Image SegmentationReferring Expression	CodeCode Available	0	5
Towards Language-guided Visual Recognition via Dynamic Convolutions	Oct 17, 2021	Question AnsweringReferring Expression	CodeCode Available	0	5
Towards Omni-supervised Referring Expression Segmentation	Nov 1, 2023	Referring ExpressionReferring Expression Segmentation	CodeCode Available	0	5
Single-Stream Multi-Level Alignment for Vision-Language Pretraining	Mar 27, 2022	Image-text RetrievalQuestion Answering	CodeCode Available	0	5
Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations?	Jun 8, 2021	Referring ExpressionSelf-Driving Cars	CodeCode Available	0	5
Scene-Text Oriented Reffering Expression Comprehension	Nov 4, 2022	Object LocalizationReferring Expression	CodeCode Available	0	5
Searching for Ambiguous Objects in Videos using Relational Referring Expressions	Aug 3, 2019	Deep AttentionNatural Language Visual Grounding	CodeCode Available	0	5
Grounding Referring Expressions in Images by Variational Context	Dec 5, 2017	Multiple Instance LearningReferring Expression	CodeCode Available	0	5
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach	Oct 3, 2022	Referring ExpressionRobot Manipulation	CodeCode Available	0	5
Generation and Comprehension of Unambiguous Object Descriptions	Nov 7, 2015	Image CaptioningObject	CodeCode Available	0	5
Continual Referring Expression Comprehension via Dual Modular Memorization	Nov 25, 2023	MemorizationReferring Expression	CodeCode Available	0	5
Resilience through Scene Context in Visual Referring Expression Generation	Apr 18, 2024	Referring ExpressionReferring expression generation	CodeCode Available	0	5
Revisiting Counterfactual Problems in Referring Expression Comprehension	Jan 1, 2024	AttributeContrastive Learning	CodeCode Available	0	5
Improving Quality and Efficiency in Plan-based Neural Data-to-Text Generation	Sep 22, 2019	Data-to-Text GenerationReferring Expression	CodeCode Available	0	5
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments	Apr 23, 2019	Referring ExpressionVision and Language Navigation	CodeCode Available	0	5
Referring Expression Comprehension Using Language Adaptive Inference	Jun 6, 2023	object-detectionObject Detection	CodeCode Available	0	5
A Real-time Global Inference Network for One-stage Referring Expression Comprehension	Dec 7, 2019	Diversityfeature selection	CodeCode Available	0	5
Reasoning About Pragmatics with Neural Listeners and Speakers	Apr 2, 2016	Referring ExpressionText Generation	CodeCode Available	0	5
Exploring Modulated Detection Transformer as a Tool for Action Recognition in Videos	Sep 21, 2022	Action DetectionAction Recognition	CodeCode Available	0	5
Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding	Sep 9, 2024	Image RetrievalReferring Expression	CodeCode Available	0	5
Collecting Visually-Grounded Dialogue with A Game Of Sorts	Sep 10, 2023	Coreference ResolutionImage Retrieval	CodeCode Available	0	5
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding	Jul 18, 2022	AttributeReferring Expression	CodeCode Available	0	5
Adversarial Robustness for Visual Grounding of Multimodal Large Language Models	May 16, 2024	Adversarial AttackAdversarial Robustness	CodeCode Available	0	5
Enriching the WebNLG corpus	Nov 1, 2018	Machine TranslationReferring Expression	CodeCode Available	0	5
Enriching the E2E dataset	Aug 1, 2021	Referring ExpressionReferring expression generation	CodeCode Available	0	5
Referring Expression Generation Using Entity Profiles	Sep 4, 2019	Referring ExpressionReferring expression generation	CodeCode Available	0	5
NeuralREG: An end-to-end approach to referring expression generation	May 21, 2018	FormReferring Expression	CodeCode Available	0	5
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions	Jan 3, 2019	DiagnosticImage Segmentation	CodeCode Available	0	5
Language-Conditioned Feature Pyramids for Visual Selection Tasks	Nov 1, 2020	Referring ExpressionReferring Expression Comprehension	CodeCode Available	0	5
Language Adaptive Weight Generation for Multi-task Visual Grounding	Jun 6, 2023	Referring ExpressionReferring Expression Comprehension	CodeCode Available	0	5
Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolution	Sep 27, 2021	coreference-resolutionCoreference Resolution	CodeCode Available	0	5
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding	Sep 5, 2019	ObjectReferring Expression	CodeCode Available	0	5
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension	Feb 17, 2023	Referring ExpressionReferring Expression Comprehension	CodeCode Available	0	5
Modeling Context Between Objects for Referring Expression Understanding	Aug 1, 2016	Multiple Instance LearningObject	CodeCode Available	0	5
Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples	May 24, 2023	DiagnosticReferring Expression	CodeCode Available	0	5
MAttNet: Modular Attention Network for Referring Expression Comprehension	Jan 24, 2018	Generalized Referring Expression SegmentationReferring Expression	CodeCode Available	0	5
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding	Aug 28, 2019	AttributeReferring Expression	CodeCode Available	0	5
MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing	Mar 31, 2025	Objectobject-detection	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Random	Acc@0.5m	14.6	—	Unverified