SOTAVerified

Referring Expression Comprehension

Papers

Showing 76100 of 167 papers

TitleStatusHype
Exploring Spatial Language Grounding Through Referring Expressions0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks0
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension0
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension0
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding0
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal ModelsCode0
Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and Regression0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
Revisiting Multi-Modal LLM Evaluation0
MaskInversion: Localized Embeddings via Optimization of Explainability Maps0
Learning Visual Grounding from Generative Vision and Language Model0
The Solution for the 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge0
M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension0
Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO0
ScanFormer: Referring Expression Comprehension by Iteratively Scanning0
Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsCode0
Text-driven Affordance Learning from Egocentric Vision0
PropTest: Automatic Property Testing for Improved Visual Programming0
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Revisiting Counterfactual Problems in Referring Expression ComprehensionCode0
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction0
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.