SOTAVerified

Grounded Situation Recognition

Grounded Situation Recognition aims to produce the structured image summary which describes the primary activity (verb), its relevant entities (nouns), and their bounding-box groundings.

Papers

Showing 110 of 15 papers

TitleStatusHype
Attention-Based Context Aware Reasoning for Situation RecognitionCode1
Collaborative Transformers for Grounded Situation RecognitionCode1
Grounded Situation RecognitionCode1
Grounded Situation Recognition with TransformersCode1
Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual ImpairmentsCode1
Rethinking the Two-Stage Framework for Grounded Situation RecognitionCode1
Situation Recognition with Graph Neural NetworksCode1
Mixture-Kernel Graph Attention Network for Situation Recognition0
Dynamic Scene Understanding from Vision-Language Representations0
Recurrent Models for Situation Recognition0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Ours (CoFormer+)Top-1 Verb58.88Unverified
2ClipSituTop-1 Verb58.19Unverified
3CoFormerTop-1 Verb44.66Unverified
4SituFormerTop-1 Verb44.2Unverified
5Kernel GraphNetTop-1 Verb43.27Unverified
6GSRTRTop-1 Verb40.63Unverified
7JSLTop-1 Verb39.94Unverified
8ISLTop-1 Verb39.36Unverified
9CAQ + RE-VGGTop-1 Verb38.19Unverified
10GraphNetTop-1 Verb36.72Unverified