SOTAVerified

Situation Recognition

Situation Recognition aims to produce the structured image summary which describes the primary activity (verb), and its relevant entities (nouns).

Papers

Showing 110 of 12 papers

TitleStatusHype
Dynamic Scene Understanding from Vision-Language Representations0
ClipSitu: Effectively Leveraging CLIP for Conditional Predictions in Situation RecognitionCode0
Collaborative Transformers for Grounded Situation RecognitionCode1
Rethinking the Two-Stage Framework for Grounded Situation RecognitionCode1
Grounded Situation Recognition with TransformersCode1
Attention-Based Context Aware Reasoning for Situation RecognitionCode1
Grounded Situation RecognitionCode1
Mixture-Kernel Graph Attention Network for Situation Recognition0
Situation Recognition with Graph Neural NetworksCode1
Recurrent Models for Situation Recognition0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OursTop-1 Verb58.88Unverified
2ClipSituTop-1 Verb47.23Unverified
3CoFormerTop-1 Verb44.66Unverified
4SituFormerTop-1 Verb44.2Unverified
5Kernel GraphNetTop-1 Verb43.27Unverified
6GSRTRTop-1 Verb40.63Unverified
7JSLTop-1 Verb39.94Unverified
8ISLTop-1 Verb39.36Unverified
9CAQ + RE-VGGTop-1 Verb38.19Unverified
10GraphNetTop-1 Verb36.72Unverified