Phrase Grounding
Given an image and a corresponding caption, the Phrase Grounding task aims to ground each entity mentioned by a noun phrase in the caption to a region in the image.
Source: Phrase Grounding by Soft-Label Chain Conditional Random Field
Papers
Showing 1–10 of 88 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GBS Ensemble + 12-in-1 | Pointing Game Accuracy | 85.9 | — | Unverified |
| 2 | GbS Ensemble MS-COCO | Pointing Game Accuracy | 75.6 | — | Unverified |
| 3 | COCO_ELMo_PNASNet | Pointing Game Accuracy | 69.19 | — | Unverified |