Phrase Grounding
Given an image and a corresponding caption, the Phrase Grounding task aims to ground each entity mentioned by a noun phrase in the caption to a region in the image.
Source: Phrase Grounding by Soft-Label Chain Conditional Random Field
Papers
Showing 1–10 of 88 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GbS VG | Pointing Game Accuracy | 55.91 | — | Unverified |
| 2 | VG_ELMo_PNASNet | Pointing Game Accuracy | 55.16 | — | Unverified |
| 3 | GbS Ensemble MS-COCO | Pointing Game Accuracy | 54.55 | — | Unverified |