SOTAVerified

Referring expression generation

Generate referring expressions

Papers

Showing 110 of 84 papers

TitleStatusHype
Vision-Language Models Are Not Pragmatically Competent in Referring Expression GenerationCode0
Frontiers in Intelligent ColonoscopyCode2
Grounding Language in Multi-Perspective Referential CommunicationCode0
Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoECode1
Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension GuidingCode0
Resilience through Scene Context in Visual Referring Expression GenerationCode0
Mini-Gemini: Mining the Potential of Multi-modality Vision Language ModelsCode7
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
Multi-modal Instruction Tuned LLMs with Fine-grained Visual PerceptionCode1
Efficient Multimodal Learning from Data-centric PerspectiveCode5
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ColonGPT (w/ LoRA, w/o extra data)Accuray99.96Unverified
2LLaVA-v1.5 (w/ LoRA, w/ extra data)Accuray99.32Unverified
3LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)Accuray99.3Unverified
4MGM-2B (w/o LoRA, w/ extra data)Accuray98.75Unverified
5LLaVA-v1.5 (w/ LoRA, w/o extra data)Accuray98.58Unverified
6MGM-2B (w/o LoRA, w/o extra data)Accuray98.17Unverified
7MobileVLM-1.7B (w/ LoRA, w/ extra data)Accuray97.87Unverified
8MobileVLM-1.7B (w/o LoRA, w/ extra data)Accuray97.78Unverified
9LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)Accuray97.74Unverified
10LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)Accuray97.35Unverified
#ModelMetricClaimedVerifiedStatus
1LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)Accuray70Unverified
2LLaVA-v1 (w/ LoRA, w/ extra data)Accuray46.85Unverified