SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 6170 of 364 papers

TitleStatusHype
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object SegmentationCode1
Kosmos-2: Grounding Multimodal Large Language Models to the WorldCode1
Advancing Referring Expression Segmentation Beyond Single ImageCode1
Zero-shot Referring Image Segmentation with Global-Local Context FeaturesCode1
NS3D: Neuro-Symbolic Grounding of 3D Objects and RelationsCode1
Layout-aware Dreamer for Embodied Referring Expression GroundingCode1
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationCode1
SQA3D: Situated Question Answering in 3D ScenesCode1
VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature AlignmentCode1
Learning to Evaluate Performance of Multi-modal Semantic LocalizationCode1
Show:102550
← PrevPage 7 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified