| Very Low Resource Sentence Alignment: Luhya and Swahili | Oct 1, 2022 | Machine TranslationSentence | —Unverified | 0 |
| Very Low Resource Sentence Alignment: Luhya and Swahili | Oct 31, 2022 | Machine TranslationSentence | —Unverified | 0 |
| Vicinity-Driven Paragraph and Sentence Alignment for Comparable Corpora | Dec 13, 2016 | SentenceText Simplification | —Unverified | 0 |
| ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos | Apr 17, 2025 | MisinformationSentence | —Unverified | 0 |
| Video-based Sign Language Recognition without Temporal Segmentation | Jan 30, 2018 | SegmentationSentence | —Unverified | 0 |
| Video Captioning Using Weak Annotation | Sep 2, 2020 | SentenceVideo Captioning | —Unverified | 0 |
| Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction | Jul 8, 2018 | DecoderLanguage Modeling | —Unverified | 0 |
| Video Captioning with Multi-Faceted Attention | Dec 1, 2016 | Information RetrievalRetrieval | —Unverified | 0 |
| Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning | Nov 5, 2019 | SentenceVideo Captioning | —Unverified | 0 |
| Video Captioning with Transferred Semantic Attributes | Nov 23, 2016 | SentenceVideo Captioning | —Unverified | 0 |
| VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing | Nov 30, 2022 | Machine TranslationSentence | —Unverified | 0 |
| Video-Grounded Dialogues with Pretrained Generation Language Models | Jun 27, 2020 | Sentence | —Unverified | 0 |
| Video Paragraph Captioning as a Text Summarization Task | Aug 1, 2021 | SentenceText Summarization | —Unverified | 0 |
| Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks | Oct 26, 2015 | SentenceVideo Captioning | —Unverified | 0 |
| Video Question Generation via Cross-Modal Self-Attention Networks Learning | Jul 5, 2019 | DiversityQuestion Answering | —Unverified | 0 |
| Video Referring Expression Comprehension via Transformer with Content-aware Query | Oct 6, 2022 | cross-modal alignmentReferring Expression | —Unverified | 0 |
| Video Referring Expression Comprehension via Transformer with Content-conditioned Query | Oct 25, 2023 | cross-modal alignmentReferring Expression | —Unverified | 0 |
| Video sentence grounding with temporally global textual knowledge | Apr 21, 2024 | Contrastive LearningRetrieval | —Unverified | 0 |
| Video Storytelling: Textual Summaries for Events | Jul 25, 2018 | DiversityReinforcement Learning | —Unverified | 0 |
| Vietnamese Open Information Extraction | Jan 23, 2018 | Dependency ParsingOpen Information Extraction | —Unverified | 0 |
| ViNLI: A Vietnamese Corpus for Studies on Open-Domain Natural Language Inference | Oct 1, 2022 | ArticlesNatural Language Inference | —Unverified | 0 |
| Vis-Eval Metric Viewer: A Visualisation Tool for Inspecting and Evaluating Metric Scores of Machine Translation Output | Jun 1, 2018 | Machine TranslationSentence | —Unverified | 0 |
| Vision Transformer Based Model for Describing a Set of Images as a Story | Oct 6, 2022 | Language ModellingSentence | —Unverified | 0 |
| Visual Agreement Regularized Training for Multi-Modal Machine Translation | Dec 27, 2019 | Machine TranslationSentence | —Unverified | 0 |
| Visual Conceptual Blending with Large-scale Language and Vision Models | Jun 27, 2021 | Image GenerationLanguage Modeling | —Unverified | 0 |