SOTAVerified

Referring expression generation

Generate referring expressions

Papers

Showing 1120 of 84 papers

TitleStatusHype
Intrinsic Task-based Evaluation for Referring Expression Generation0
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language ModelsCode0
GLaMM: Pixel Grounding Large Multimodal ModelCode2
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
Improved Baselines with Visual Instruction TuningCode6
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Whether you can locate or not? Interactive Referring Expression GenerationCode0
Kosmos-2: Grounding Multimodal Large Language Models to the WorldCode1
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One DayCode4
Show:102550
← PrevPage 2 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ColonGPT (w/ LoRA, w/o extra data)Accuray99.96Unverified
2LLaVA-v1.5 (w/ LoRA, w/ extra data)Accuray99.32Unverified
3LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)Accuray99.3Unverified
4MGM-2B (w/o LoRA, w/ extra data)Accuray98.75Unverified
5LLaVA-v1.5 (w/ LoRA, w/o extra data)Accuray98.58Unverified
6MGM-2B (w/o LoRA, w/o extra data)Accuray98.17Unverified
7MobileVLM-1.7B (w/ LoRA, w/ extra data)Accuray97.87Unverified
8MobileVLM-1.7B (w/o LoRA, w/ extra data)Accuray97.78Unverified
9LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)Accuray97.74Unverified
10LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)Accuray97.35Unverified
#ModelMetricClaimedVerifiedStatus
1LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)Accuray70Unverified
2LLaVA-v1 (w/ LoRA, w/ extra data)Accuray46.85Unverified