SOTAVerified

Visual Dialog

Visual Dialog requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content. Specifically, given an image, a dialog history, and a follow-up question about the image, the task is to answer the question.

Papers

Showing 5175 of 118 papers

TitleStatusHype
The Impact of Answers in Referential Visual Dialog0
Variational Disentangled Attention for Regularized Visual Dialog0
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog0
Learning to Ground Visual Objects for Visual Dialog0
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented GuesserCode0
SeqDialN: Sequential Visual Dialog Network in Joint Visual-Linguistic Representation SpaceCode0
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic RepresentationCode0
Visual-Textual Alignment for Graph Inference in Visual Dialog0
Reasoning Over History: Context Aware Visual Dialog0
Multi-Modal Open-Domain Dialogue0
Answer-Driven Visual State Estimator for Goal-Oriented Visual DialogueCode0
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation SpaceCode0
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA DataCode0
Effective questions in referential visual dialogue0
Towards Visual Dialog for Radiology0
ORD: Object Relationship Discovery for Visual Dialogue Generation0
Modality-Balanced Models for Visual Dialogue0
Ensemble based discriminative models for Visual Dialog Challenge 20180
Vision and Language: from Visual Perception to Content Creation0
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual DialogCode0
TAB-VCR: Tags and Attributes based VCR BaselinesCode0
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple InputsCode0
Two Causal Principles for Improving Visual DialogCode0
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual DialogueCode0
Video Dialog via Progressive Inference and Cross-Transformer0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SingleNDCG (x 100)78.7Unverified
2P1P2+Distill+EnsembleNDCG (x 100)77.92Unverified
3Ensemble + Fine-tuningNDCG (x 100)76.43Unverified
4ensemble, finetuneNDCG (x 100)76.17Unverified
5VD-PCRNDCG (x 100)76.14Unverified
6EnsembleNDCG (x 100)75.35Unverified
7Ensemble + FinetuneNDCG (x 100)74.88Unverified
8bert-double-stream-finetuningNDCG (x 100)74.62Unverified
9CE-finetuned, single modelNDCG (x 100)74.47Unverified
102NDCG (x 100)73.36Unverified
#ModelMetricClaimedVerifiedStatus
19xFGA (VGG)MRR68.92Unverified
2DANMRR66.38Unverified
3CorefNMN (ResNet-152)MRR64.1Unverified
4CoAttMRR63.98Unverified
5CorefNMNMRR63.6Unverified
6DualVDMRR62.94Unverified
7SF-QIH-se-2MRR62.42Unverified
8HCIAE-NP-ATTMRR62.22Unverified
9HieCoAtt-QIMRR57.88Unverified
10AMEMR@148.53Unverified
#ModelMetricClaimedVerifiedStatus
15xFGA + LSNDCG64.04Unverified
25xFGA + LS*+MRR0.71Unverified
3Two-StepMRR0.7Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41.1Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-41.5Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-440Unverified
#ModelMetricClaimedVerifiedStatus
1Multi-Modal BlenderBotBLEU-42.2Unverified