DVCFlow: Modeling Information Flow Towards Human-like Video Captioning Nov 19, 2021 Dense Video Captioning Diversity
— Unverified 0Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 0An Integrated Approach for Video Captioning and Applications Jan 23, 2022 Image Captioning Video Captioning
— Unverified 0Dual-Level Decoupled Transformer for Video Captioning May 6, 2022 Descriptive Sentence
— Unverified 0Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 0Less Is More: Picking Informative Frames for Video Captioning Mar 5, 2018 Decoder Diversity
— Unverified 0Diverse Video Captioning Through Latent Variable Expansion Oct 26, 2019 Diversity Generative Adversarial Network
— Unverified 0Boosting Video Representation Learning with Multi-Faceted Integration Jan 11, 2022 Action Recognition Representation Learning
— Unverified 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Nov 19, 2024 GPU Question Answering
— Unverified 0Discourse Analysis for Evaluating Coherence in Video Paragraph Captions Jan 17, 2022 Video Captioning Visual Dialog
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos Jul 11, 2019 Dense Captioning Dense Video Captioning
— Unverified 0Directed Domain Fine-Tuning: Tailoring Separate Modalities for Specific Training Tasks Jun 24, 2024 Question Answering Text Generation
— Unverified 0DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Apr 3, 2024 Dense Video Captioning Diversity
— Unverified 0Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Nov 28, 2016 Event Detection Video Captioning
— Unverified 0Describe Anything: Detailed Localized Image and Video Captioning Apr 22, 2025 Sentence Video Captioning
— Unverified 0Improving Interpretability of Deep Neural Networks with Semantic Information Mar 12, 2017 Action Recognition Temporal Action Localization
— Unverified 0Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 0Beyond Caption To Narrative: Video Captioning With Multiple Sentences May 18, 2016 Action Localization Image Captioning
— Unverified 0Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols Nov 5, 2023 Caption Generation Dense Video Captioning
— Unverified 0End-to-End Video Captioning Apr 4, 2019 Action Recognition Caption Generation
— Unverified 0Image-to-Video Person Re-Identification by Reusing Cross-modal Embeddings Oct 4, 2018 Image Captioning Image-To-Video Person Re-Identification
— Unverified 0IcoCap: Improving Video Captioning by Compounding Images Oct 5, 2023 Image Captioning Video Captioning
— Unverified 0HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation Nov 27, 2024 Graph Generation Question Answering
— Unverified 0Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning Nov 7, 2018 Mixture-of-Experts Video Captioning
— Unverified 0LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models Feb 21, 2025 Caption Generation Video Captioning
— Unverified 0Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos Jun 25, 2018 Dense Captioning Optical Flow Estimation
— Unverified 0Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding May 16, 2020 Abstractive Text Summarization Decoder
— Unverified 0Learning Actions from Human Demonstration Video for Robotic Manipulation Sep 10, 2019 Video Captioning
— Unverified 0HiVLP: Hierarchical Interactive Video-Language Pre-Training Jan 1, 2023 Retrieval Self-Supervised Learning
— Unverified 0An Attempt towards Interpretable Audio-Visual Video Captioning Dec 7, 2018 Audio captioning Audio-Visual Video Captioning
— Unverified 0HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training Dec 30, 2022 cross-modal alignment TGIF-Action
— Unverified 0An Efficient Keyframes Selection Based Framework for Video Captioning Dec 1, 2021 Text Generation Video Captioning
— Unverified 0Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 0Hierarchical Recurrent Neural Network for Video Summarization Apr 28, 2019 Video Captioning Video Summarization
— Unverified 0Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning Nov 11, 2015 image-classification Image Classification
— Unverified 0Hierarchical Multimodal Transformer to Summarize Videos Sep 22, 2021 Machine Translation Supervised Video Summarization
— Unverified 0Deep Reinforcement Learning for NLP Jul 1, 2018 Atari Games coreference-resolution
— Unverified 0Human Action Sequence Classification Oct 7, 2019 Action Classification Action Localization
— Unverified 0Human-centric Behavior Description in Videos: New Benchmark and Model Oct 4, 2023 Video Captioning
— Unverified 0CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 0Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Mar 24, 2024 Dense Video Captioning Temporal Localization
— Unverified 0Hierarchical Memory Decoding for Video Captioning Feb 27, 2020 Decoder Video Captioning
— Unverified 0Hierarchical memory decoder for visual narrating Sep 1, 2020 Decoder Image Captioning
— Unverified 0Imperial College London Submission to VATEX Video Captioning Task Oct 16, 2019 Decoder Video Captioning
— Unverified 0Implicit and Explicit Commonsense for Multi-sentence Video Captioning Mar 14, 2023 Imitation Learning Sentence
— Unverified 0Dense Video Captioning using Graph-based Sentence Summarization Jun 25, 2025 Dense Video Captioning Sentence
— Unverified 0Crowd Video Captioning Nov 13, 2019 Video Captioning
— Unverified 0Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning Jun 5, 2017 Caption Generation Decoder
— Unverified 0Hierarchical LSTMs with Adaptive Attention for Visual Captioning Dec 26, 2018 Caption Generation Image Captioning
— Unverified 0