Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023) Dec 12, 2023 Decoder Video Captioning
— Unverified 0Attention based video captioning framework for Hindi Jun 17, 2021 Video Captioning
— Unverified 0Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers Jun 6, 2019 All Dense Video Captioning
— Unverified 0Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information May 7, 2022 Text Generation Video Captioning
— Unverified 0Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training Jul 5, 2020 Decoder Question Answering
— Unverified 0Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Apr 7, 2021 Descriptive Text Summarization
— Unverified 0Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Mar 24, 2024 Dense Video Captioning Temporal Localization
— Unverified 0Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos Jun 25, 2018 Dense Captioning Optical Flow Estimation
— Unverified 0Beyond Caption To Narrative: Video Captioning With Multiple Sentences May 18, 2016 Action Localization Image Captioning
— Unverified 0Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 0Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Nov 28, 2016 Event Detection Video Captioning
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0Boosting Video Representation Learning with Multi-Faceted Integration Jan 11, 2022 Action Recognition Representation Learning
— Unverified 0Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 0Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 0Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Dec 14, 2024 Descriptive Language Modeling
— Unverified 0FIOVA: A Multi-Annotator Benchmark for Human-Aligned Video Captioning Oct 20, 2024 Diagnostic Video Captioning
— Unverified 0Prediction and Description of Near-Future Activities in Video Aug 2, 2019 Prediction Video Captioning
— Unverified 0Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning Feb 19, 2025 Knowledge Distillation Object
— Unverified 0Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models Nov 22, 2019 Decoder Video Captioning
— Unverified 0Chinese Whispers: Cooperative Paraphrase Acquisition May 1, 2012 Machine Translation Natural Language Inference
— Unverified 0Classifier-Guided Captioning Across Modalities Jan 3, 2025 Audio captioning Video Captioning
— Unverified 0CLIP4Caption: CLIP for Video Caption Oct 13, 2021 Decoder Sentence
— Unverified 0CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising Dec 14, 2021 Cross-Modal Retrieval Decoder
— Unverified 0Collaborative Three-Stream Transformers for Video Captioning Sep 18, 2023 Sentence Video Captioning
— Unverified 0Consensus-based Sequence Training for Video Captioning Dec 27, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning Video Representations using Contrastive Bidirectional Transformer Jun 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Mar 31, 2022 Retrieval Video Captioning
— Unverified 0CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 0Crowd Video Captioning Nov 13, 2019 Video Captioning
— Unverified 0CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 0Deep Reinforcement Learning for NLP Jul 1, 2018 Atari Games coreference-resolution
— Unverified 0Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols Nov 5, 2023 Caption Generation Dense Video Captioning
— Unverified 0Dense Video Captioning using Graph-based Sentence Summarization Jun 25, 2025 Dense Video Captioning Sentence
— Unverified 0Describe Anything: Detailed Localized Image and Video Captioning Apr 22, 2025 Sentence Video Captioning
— Unverified 0DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Apr 3, 2024 Dense Video Captioning Diversity
— Unverified 0Directed Domain Fine-Tuning: Tailoring Separate Modalities for Specific Training Tasks Jun 24, 2024 Question Answering Text Generation
— Unverified 0Discourse Analysis for Evaluating Coherence in Video Paragraph Captions Jan 17, 2022 Video Captioning Visual Dialog
— Unverified 0Diverse Video Captioning Through Latent Variable Expansion Oct 26, 2019 Diversity Generative Adversarial Network
— Unverified 0Dual-Level Decoupled Transformer for Video Captioning May 6, 2022 Descriptive Sentence
— Unverified 0DVCFlow: Modeling Information Flow Towards Human-like Video Captioning Nov 19, 2021 Dense Video Captioning Diversity
— Unverified 0E-MMAD: Multimodal Advertising Caption Generation Based on Structured Information Nov 16, 2021 Caption Generation valid
— Unverified 0Empirical Autopsy of Deep Video Captioning Frameworks Nov 21, 2019 Decoder Language Modelling
— Unverified 0Encoder-Decoder Based Long Short-Term Memory (LSTM) Model for Video Captioning Oct 2, 2023 Decoder Sentence
— Unverified 0End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering Oct 10, 2016 Language Modeling Language Modelling
— Unverified 0End-to-end Dense Video Captioning as Sequence Generation Jan 16, 2022 Dense Video Captioning Descriptive
— Unverified 0End-to-end Dense Video Captioning as Sequence Generation Apr 18, 2022 Dense Video Captioning Descriptive
— Unverified 0End-to-end Generative Pretraining for Multimodal Video Captioning Jan 20, 2022 Action Classification Decoder
— Unverified 0Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization Oct 9, 2024 Audio captioning Large Language Model
— Unverified 0