Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Dec 14, 2024 Descriptive Language Modeling
— Unverified 00 Understanding Action Sequences based on Video Captioning for Learning-from-Observation Dec 9, 2020 Video Captioning Video Understanding
— Unverified 00 Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 00 Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 00 Boosting Video Representation Learning with Multi-Faceted Integration Jan 11, 2022 Action Recognition Representation Learning
— Unverified 00 Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 00 Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Nov 28, 2016 Event Detection Video Captioning
— Unverified 00 Variational Stacked Local Attention Networks for Diverse Video Captioning Jan 4, 2022 Decoder Diversity
— Unverified 00 Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 00 Beyond Caption To Narrative: Video Captioning With Multiple Sentences May 18, 2016 Action Localization Image Captioning
— Unverified 00 VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning Oct 13, 2019 Video Captioning
— Unverified 00 Vector Learning for Cross Domain Representations Sep 27, 2018 Decoder Image Captioning
— Unverified 00 VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks Jun 10, 2025 Multiple-choice Open-Ended Question Answering
— Unverified 00 Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos Jun 25, 2018 Dense Captioning Optical Flow Estimation
— Unverified 00 ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Dec 12, 2024 Phrase Grounding Question Answering
— Unverified 00 Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Mar 24, 2024 Dense Video Captioning Temporal Localization
— Unverified 00 Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Apr 7, 2021 Descriptive Text Summarization
— Unverified 00 Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training Jul 5, 2020 Decoder Question Answering
— Unverified 00 Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information May 7, 2022 Text Generation Video Captioning
— Unverified 00 Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers Jun 6, 2019 All Dense Video Captioning
— Unverified 00 AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Jan 1, 2025 GPU Question Answering
— Unverified 00 Video Captioning: a comparative review of where we are and which could be the route Apr 12, 2022 Video Captioning
— Unverified 00 Video Captioning in Compressed Video Jan 2, 2021 Caption Generation Video Captioning
— Unverified 00 Video Captioning Using Weak Annotation Sep 2, 2020 Sentence Video Captioning
— Unverified 00 Video Captioning via Hierarchical Reinforcement Learning Nov 29, 2017 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 00 Video Captioning with Aggregated Features Based on Dual Graphs and Gated Fusion Aug 13, 2023 Video Captioning
— Unverified 00 Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction Jul 8, 2018 Decoder Language Modeling
— Unverified 00 Video Captioning with Guidance of Multimodal Latent Topics Aug 31, 2017 Caption Generation Decoder
— Unverified 00 Video Captioning with Multi-Faceted Attention Dec 1, 2016 Information Retrieval Retrieval
— Unverified 00 Attention based video captioning framework for Hindi Jun 17, 2021 Video Captioning
— Unverified 00 Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning Nov 5, 2019 Sentence Video Captioning
— Unverified 00 Video Captioning with Transferred Semantic Attributes Nov 23, 2016 Sentence Video Captioning
— Unverified 00 Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023) Dec 12, 2023 Decoder Video Captioning
— Unverified 00 Graph Similarities and Dual Approach for Sequential Text-to-Image Retrieval Sep 29, 2021 Graph Embedding Image Retrieval
— Unverified 00 Grounded Objects and Interactions for Video Captioning Nov 16, 2017 Object Scene Understanding
— Unverified 00 Global2Local: A Joint-Hierarchical Attention for Video Captioning Mar 13, 2022 Video Captioning
— Unverified 00 GUI Action Narrator: Where and When Did That Action Take Place? Jun 19, 2024 Optical Character Recognition (OCR) Video Captioning
— Unverified 00 Guidance Module Network for Video Captioning Dec 20, 2020 Decoder Sentence
— Unverified 00 Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag Nov 1, 2019 POS TAG
— Unverified 00 Get In Video: Add Anything You Want to the Video Mar 8, 2025 object-detection Object Detection
— Unverified 00 Generative Adversarial Network Applications in Creating a Meta-Universe Jan 23, 2022 Generative Adversarial Network Image-to-Image Translation
— Unverified 00 Generating Video Descriptions with Topic Guidance Aug 31, 2017 Decoder Image Captioning
— Unverified 00 Hierarchical Banzhaf Interaction for General Video-Language Representation Learning Dec 30, 2024 Contrastive Learning Question Answering
— Unverified 00 Hierarchical Boundary-Aware Neural Encoder for Video Captioning Nov 28, 2016 Decoder Video Captioning
— Unverified 00 Hierarchical LSTMs with Adaptive Attention for Visual Captioning Dec 26, 2018 Caption Generation Image Captioning
— Unverified 00 Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning Jun 5, 2017 Caption Generation Decoder
— Unverified 00 Hierarchical memory decoder for visual narrating Sep 1, 2020 Decoder Image Captioning
— Unverified 00 Hierarchical Memory Decoding for Video Captioning Feb 27, 2020 Decoder Video Captioning
— Unverified 00 Exploiting Auxiliary Caption for Video Grounding Jan 15, 2023 Contrastive Learning Dense Video Captioning
— Unverified 00 Hierarchical Multimodal Transformer to Summarize Videos Sep 22, 2021 Machine Translation Supervised Video Summarization
— Unverified 00