Active Learning for Video Description With Cluster-Regularized Ensemble Ranking Jul 27, 2020 Active Learning Video Captioning
— Unverified 0Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos Jul 11, 2019 Dense Captioning Dense Video Captioning
— Unverified 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Nov 19, 2024 GPU Question Answering
— Unverified 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Jan 1, 2025 GPU Question Answering
— Unverified 0Adaptive Feature Abstraction for Translating Video to Text Nov 23, 2016 Video Captioning
— Unverified 0A Dataset for Telling the Stories of Social Media Videos Oct 1, 2018 Sentence Video Captioning
— Unverified 0Agent-based Video Trimming Dec 12, 2024 Highlight Detection Moment Retrieval
— Unverified 0Aligning Source Visual and Target Language Domains for Unpaired Video Captioning Nov 22, 2022 Translation Video Captioning
— Unverified 0Amortized Context Vector Inference for Sequence-to-Sequence Networks May 23, 2018 Document Summarization Variational Inference
— Unverified 0Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks Oct 7, 2023 Action Recognition Multiple-choice
— Unverified 0An Attempt towards Interpretable Audio-Visual Video Captioning Dec 7, 2018 Audio captioning Audio-Visual Video Captioning
— Unverified 0An Efficient Keyframes Selection Based Framework for Video Captioning Dec 1, 2021 Text Generation Video Captioning
— Unverified 0End-to-End Video Captioning Apr 4, 2019 Action Recognition Caption Generation
— Unverified 0An Integrated Approach for Video Captioning and Applications Jan 23, 2022 Image Captioning Video Captioning
— Unverified 0A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Dec 25, 2023 Image Generation Text to Image Generation
— Unverified 0A Restricted Visual Turing Test for Deep Scene and Event Understanding Dec 6, 2015 Question Answering Video Captioning
— Unverified 0A Review of Deep Learning for Video Captioning Apr 22, 2023 Deep Learning Dense Video Captioning
— Unverified 0ARGUS: Hallucination and Omission Evaluation in Video-LLMs Jun 9, 2025 Descriptive Form
— Unverified 0A Shared Task on Multimodal Machine Translation and Crosslingual Image Description Aug 1, 2016 Image Description Image Retrieval
— Unverified 0A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection) May 2, 2024 Acoustic Scene Classification Event Detection
— Unverified 0Attend and Interact: Higher-Order Object Interactions for Video Understanding Nov 16, 2017 Action Classification Action Recognition
— Unverified 0Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023) Dec 12, 2023 Decoder Video Captioning
— Unverified 0Attention based video captioning framework for Hindi Jun 17, 2021 Video Captioning
— Unverified 0Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers Jun 6, 2019 All Dense Video Captioning
— Unverified 0Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information May 7, 2022 Text Generation Video Captioning
— Unverified 0Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training Jul 5, 2020 Decoder Question Answering
— Unverified 0Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Apr 7, 2021 Descriptive Text Summarization
— Unverified 0Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Mar 24, 2024 Dense Video Captioning Temporal Localization
— Unverified 0Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos Jun 25, 2018 Dense Captioning Optical Flow Estimation
— Unverified 0Beyond Caption To Narrative: Video Captioning With Multiple Sentences May 18, 2016 Action Localization Image Captioning
— Unverified 0Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 0Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Nov 28, 2016 Event Detection Video Captioning
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0Boosting Video Representation Learning with Multi-Faceted Integration Jan 11, 2022 Action Recognition Representation Learning
— Unverified 0Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 0Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 0Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Dec 14, 2024 Descriptive Language Modeling
— Unverified 0FIOVA: A Multi-Annotator Benchmark for Human-Aligned Video Captioning Oct 20, 2024 Diagnostic Video Captioning
— Unverified 0Prediction and Description of Near-Future Activities in Video Aug 2, 2019 Prediction Video Captioning
— Unverified 0Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning Feb 19, 2025 Knowledge Distillation Object
— Unverified 0Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models Nov 22, 2019 Decoder Video Captioning
— Unverified 0Chinese Whispers: Cooperative Paraphrase Acquisition May 1, 2012 Machine Translation Natural Language Inference
— Unverified 0Classifier-Guided Captioning Across Modalities Jan 3, 2025 Audio captioning Video Captioning
— Unverified 0CLIP4Caption: CLIP for Video Caption Oct 13, 2021 Decoder Sentence
— Unverified 0CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising Dec 14, 2021 Cross-Modal Retrieval Decoder
— Unverified 0Collaborative Three-Stream Transformers for Video Captioning Sep 18, 2023 Sentence Video Captioning
— Unverified 0Consensus-based Sequence Training for Video Captioning Dec 27, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning Video Representations using Contrastive Bidirectional Transformer Jun 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Mar 31, 2022 Retrieval Video Captioning
— Unverified 0