SOTAVerified

Grounded Situation Recognition

Grounded Situation Recognition aims to produce the structured image summary which describes the primary activity (verb), its relevant entities (nouns), and their bounding-box groundings.

Papers

No papers found.

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Ours (CoFormer+)Top-1 Verb58.88Unverified
2ClipSituTop-1 Verb58.19Unverified
3CoFormerTop-1 Verb44.66Unverified
4SituFormerTop-1 Verb44.2Unverified
5Kernel GraphNetTop-1 Verb43.27Unverified
6GSRTRTop-1 Verb40.63Unverified
7JSLTop-1 Verb39.94Unverified
8ISLTop-1 Verb39.36Unverified
9CAQ + RE-VGGTop-1 Verb38.19Unverified
10GraphNetTop-1 Verb36.72Unverified