SOTAVerified

Referring Expression

Referring expressions places a bounding box around the instance corresponding to the provided description and image.

Papers

Showing 1120 of 364 papers

TitleStatusHype
Text4Seg: Reimagining Image Segmentation as Text GenerationCode2
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression SegmentationCode2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal ModelsCode2
F-LMM: Grounding Frozen Large Multimodal ModelsCode2
Decoupling Static and Hierarchical Motion Perception for Referring Video SegmentationCode2
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression SegmentationCode2
NExT-Chat: An LMM for Chat, Detection and SegmentationCode2
GLaMM: Pixel Grounding Large Multimodal ModelCode2
GREC: Generalized Referring Expression ComprehensionCode2
Show:102550
← PrevPage 2 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RandomAcc@0.5m14.6Unverified