SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 101150 of 364 papers

TitleStatusHype
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression ComprehensionCode1
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEsCode1
3D-GRES: Generalized 3D Referring Expression SegmentationCode1
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring ExpressionCode1
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression SegmentationCode1
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
Towards Language-guided Visual Recognition via Dynamic ConvolutionsCode0
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target GranularitiesCode0
Deconfounded Visual GroundingCode0
Grounding Referring Expressions in Images by Variational ContextCode0
Grounding Language in Multi-Perspective Referential CommunicationCode0
A Joint Speaker-Listener-Reinforcer Model for Referring ExpressionsCode0
Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsCode0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal ModelsCode0
Cross-Modal Self-Attention Network for Referring Image SegmentationCode0
Understanding Synonymous Referring Expressions via Contrastive FeaturesCode0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations?Code0
Give Me Something to Eat: Referring Expression Comprehension with Commonsense KnowledgeCode0
Scene-Text Oriented Reffering Expression ComprehensionCode0
Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic ApproachCode0
Searching for Ambiguous Objects in Videos using Relational Referring ExpressionsCode0
Generation and Comprehension of Unambiguous Object DescriptionsCode0
Towards Omni-supervised Referring Expression SegmentationCode0
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
Resilience through Scene Context in Visual Referring Expression GenerationCode0
Revisiting Counterfactual Problems in Referring Expression ComprehensionCode0
Improving Quality and Efficiency in Plan-based Neural Data-to-Text GenerationCode0
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor EnvironmentsCode0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Referring Expression Comprehension Using Language Adaptive InferenceCode0
A Real-time Global Inference Network for One-stage Referring Expression ComprehensionCode0
Reasoning About Pragmatics with Neural Listeners and SpeakersCode0
Exploring Modulated Detection Transformer as a Tool for Action Recognition in VideosCode0
Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension GuidingCode0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsCode0
Enriching the WebNLG corpusCode0
Enriching the E2E datasetCode0
Referring Expression Generation Using Entity ProfilesCode0
NeuralREG: An end-to-end approach to referring expression generationCode0
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring ExpressionsCode0
Language-Conditioned Feature Pyramids for Visual Selection TasksCode0
Language Adaptive Weight Generation for Multi-task Visual GroundingCode0
Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolutionCode0
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression GroundingCode0
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression ComprehensionCode0
Modeling Context Between Objects for Referring Expression UnderstandingCode0
Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from ExamplesCode0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified