| PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension | Dec 20, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Modular Graph Attention Network for Complex Visual Relational Reasoning | Nov 22, 2020 | Graph AttentionQuestion Answering | —Unverified | 0 |
| ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments | Nov 15, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Language-Conditioned Feature Pyramids for Visual Selection Tasks | Nov 1, 2020 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| Commands 4 Autonomous Vehicles (C4AV) Workshop Summary | Sep 18, 2020 | Autonomous VehiclesReferring Expression Comprehension | —Unverified | 0 |
| Cosine meets Softmax: A tough-to-beat baseline for visual grounding | Sep 13, 2020 | Autonomous DrivingMetric Learning | CodeCode Available | 0 |
| AttnGrounder: Talking to Cars with Attention | Sep 11, 2020 | Referring Expression ComprehensionVisual Grounding | CodeCode Available | 0 |
| Referring Expression Comprehension: A Survey of Methods and Datasets | Jul 19, 2020 | object-detectionObject Detection | —Unverified | 0 |
| ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph | Jun 30, 2020 | AttributePrediction | —Unverified | 0 |
| Large-Scale Adversarial Training for Vision-and-Language Representation Learning | Jun 11, 2020 | Image-text RetrievalQuestion Answering | CodeCode Available | 1 |