SOTAVerified

Visual Dialog

Visual Dialog requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content. Specifically, given an image, a dialog history, and a follow-up question about the image, the task is to answer the question.

Papers

Showing 101118 of 118 papers

TitleStatusHype
Learning Goal-Oriented Visual Dialog via Tempered Policy GradientCode0
Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information0
Connecting Language and Vision to Actions0
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7Code1
Ask No More: Deciding when to guess in referential visual dialogueCode0
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog0
Dialog-based Interactive Image RetrievalCode0
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering0
Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual DialogCode1
FlipDial: A Generative Model for Two-Way Visual Dialogue0
Examining Cooperation in Visual Dialog ModelsCode0
Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning0
Visual Reference Resolution using Attention Memory for Visual Dialog0
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog ModelCode0
Learning to Reason: End-to-End Module Networks for Visual Question AnsweringCode0
Learning Cooperative Visual Dialog Agents with Deep Reinforcement LearningCode1
Visual DialogCode1
Hierarchical Question-Image Co-Attention for Visual Question AnsweringCode1
Show:102550
← PrevPage 5 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SingleNDCG (x 100)78.7Unverified
2P1P2+Distill+EnsembleNDCG (x 100)77.92Unverified
3Ensemble + Fine-tuningNDCG (x 100)76.43Unverified
4ensemble, finetuneNDCG (x 100)76.17Unverified
5VD-PCRNDCG (x 100)76.14Unverified
6EnsembleNDCG (x 100)75.35Unverified
7Ensemble + FinetuneNDCG (x 100)74.88Unverified
8bert-double-stream-finetuningNDCG (x 100)74.62Unverified
9CE-finetuned, single modelNDCG (x 100)74.47Unverified
102NDCG (x 100)73.36Unverified
#ModelMetricClaimedVerifiedStatus
19xFGA (VGG)MRR68.92Unverified
2DANMRR66.38Unverified
3CorefNMN (ResNet-152)MRR64.1Unverified
4CoAttMRR63.98Unverified
5CorefNMNMRR63.6Unverified
6DualVDMRR62.94Unverified
7SF-QIH-se-2MRR62.42Unverified
8HCIAE-NP-ATTMRR62.22Unverified
9HieCoAtt-QIMRR57.88Unverified
10AMEMR@148.53Unverified
#ModelMetricClaimedVerifiedStatus
15xFGA + LSNDCG64.04Unverified
25xFGA + LS*+MRR0.71Unverified
3Two-StepMRR0.7Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41.1Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41.5Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-440Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-42.2Unverified