| Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization | Aug 4, 2020 | Graph AttentionSentence | CodeCode Available | 1 |
| AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting | Aug 3, 2020 | Language ModellingSentence | CodeCode Available | 1 |
| Improving One-stage Visual Grounding by Recursive Sub-query Construction | Aug 3, 2020 | SentenceSentence Embedding | CodeCode Available | 1 |
| Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language | Aug 1, 2020 | Sentence | CodeCode Available | 1 |
| Character-Preserving Coherent Story Visualization | Aug 1, 2020 | Representation LearningSentence | CodeCode Available | 1 |
| Learning to Generate Grounded Visual Captions without Localization Supervision | Aug 1, 2020 | Image CaptioningLanguage Modelling | CodeCode Available | 1 |
| Interactive Text Graph Mining with a Prolog-based Dialog Engine | Jul 31, 2020 | Graph MiningSentence | CodeCode Available | 1 |
| What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation | Jul 30, 2020 | Conversational RecommendationLanguage Modelling | CodeCode Available | 1 |
| Comprehensive Image Captioning via Scene Graph Decomposition | Jul 23, 2020 | DiversityImage Captioning | CodeCode Available | 1 |
| Learning to Discretely Compose Reasoning Module Networks for Video Captioning | Jul 17, 2020 | DecoderQuestion Answering | CodeCode Available | 1 |