Phrase Grounding
Given an image and a corresponding caption, the Phrase Grounding task aims to ground each entity mentioned by a noun phrase in the caption to a region in the image.
Source: Phrase Grounding by Soft-Label Chain Conditional Random Field
Papers
Showing 1–10 of 88 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Fiber-B | R@1 | 87.1 | — | Unverified |
| 2 | PEVL | R@1 | 84.1 | — | Unverified |
| 3 | VisualBERT | R@1 | 70.4 | — | Unverified |