MAViC: Multimodal Active Learning for Video Captioning Dec 11, 2022 Active Learning Decoder
— Unverified 0VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners Dec 9, 2022 Question Answering Retrieval
— Unverified 0Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning Nov 28, 2022 FAD Video Captioning
Code Code Available 0Aligning Source Visual and Target Language Domains for Unpaired Video Captioning Nov 22, 2022 Translation Video Captioning
— Unverified 0Event and Entity Extraction from Generated Video Captions Nov 5, 2022 Caption Generation Dense Video Captioning
Code Code Available 0Fighting FIRe with FIRE: Assessing the Validity of Text-to-Video Retrieval Benchmarks Oct 10, 2022 Retrieval Text to Video Retrieval
— Unverified 0Recipe Generation from Unsegmented Cooking Videos Sep 21, 2022 Dense Video Captioning Recipe Generation
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation Sep 13, 2022 Image Generation Story Continuation
Code Code Available 0Diverse Video Captioning by Adaptive Spatio-temporal Attention Aug 19, 2022 Decoder Diversity
Code Code Available 0Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 0SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions Jul 24, 2022 Dense Captioning Dense Video Captioning
— Unverified 0Dual-Stream Transformer for Generic Event Boundary Captioning Jul 7, 2022 Boundary Captioning Video Captioning
Code Code Available 0PIC 4th Challenge: Semantic-Assisted Multi-Feature Encoding and Multi-Head Decoding for Dense Video Captioning Jul 6, 2022 Dense Video Captioning Video Captioning
— Unverified 0Modality Alignment between Deep Representations for Effective Video-and-Language Learning Jun 1, 2022 Question Answering Video Captioning
— Unverified 0Support-set based Multi-modal Representation Enhancement for Video Captioning May 19, 2022 Video Captioning
Code Code Available 0Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information May 7, 2022 Text Generation Video Captioning
— Unverified 0Dual-Level Decoupled Transformer for Video Captioning May 6, 2022 Descriptive Sentence
— Unverified 0Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos Apr 28, 2022 Action Understanding Video Captioning
Code Code Available 0End-to-end Dense Video Captioning as Sequence Generation Apr 18, 2022 Dense Video Captioning Descriptive
— Unverified 0Semantic-Aware Pretraining for Dense Video Captioning Apr 13, 2022 Dense Captioning Dense Video Captioning
— Unverified 0Video Captioning: a comparative review of where we are and which could be the route Apr 12, 2022 Video Captioning
— Unverified 0Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Mar 31, 2022 Retrieval Video Captioning
— Unverified 0Global2Local: A Joint-Hierarchical Attention for Video Captioning Mar 13, 2022 Video Captioning
— Unverified 0Exploiting long-term temporal dynamics for video captioning Feb 22, 2022 Video Captioning
— Unverified 0BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment Jan 25, 2022 Language Modeling Language Modelling
Code Code Available 0Generative Adversarial Network Applications in Creating a Meta-Universe Jan 23, 2022 Generative Adversarial Network Image-to-Image Translation
— Unverified 0An Integrated Approach for Video Captioning and Applications Jan 23, 2022 Image Captioning Video Captioning
— Unverified 0End-to-end Generative Pretraining for Multimodal Video Captioning Jan 20, 2022 Action Classification Decoder
— Unverified 0Discourse Analysis for Evaluating Coherence in Video Paragraph Captions Jan 17, 2022 Video Captioning Visual Dialog
— Unverified 0End-to-end Dense Video Captioning as Sequence Generation Jan 16, 2022 Dense Video Captioning Descriptive
— Unverified 0Boosting Video Representation Learning with Multi-Faceted Integration Jan 11, 2022 Action Recognition Representation Learning
— Unverified 0Variational Stacked Local Attention Networks for Diverse Video Captioning Jan 4, 2022 Decoder Diversity
— Unverified 0Dense Video Captioning Using Unsupervised Semantic Information Dec 15, 2021 Dense Video Captioning Video Captioning
Code Code Available 0CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising Dec 14, 2021 Cross-Modal Retrieval Decoder
— Unverified 0Syntax Customized Video Captioning by Imitating Exemplar Sentences Dec 2, 2021 Decoder Diversity
Code Code Available 0Multi-modal Dependency Tree for Video Captioning Dec 1, 2021 Caption Generation Dependency Parsing
— Unverified 0An Efficient Keyframes Selection Based Framework for Video Captioning Dec 1, 2021 Text Generation Video Captioning
— Unverified 0CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter Nov 30, 2021 Caption Generation Representation Learning
Code Code Available 0DVCFlow: Modeling Information Flow Towards Human-like Video Captioning Nov 19, 2021 Dense Video Captioning Diversity
— Unverified 0Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework Nov 16, 2021 Multiple-choice Question Answering
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0E-MMAD: Multimodal Advertising Caption Generation Based on Structured Information Nov 16, 2021 Caption Generation valid
— Unverified 0Visual-aware Attention Dual-stream Decoder for Video Captioning Oct 16, 2021 Decoder Video Captioning
— Unverified 0CLIP4Caption: CLIP for Video Caption Oct 13, 2021 Decoder Sentence
— Unverified 0CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 0Graph Similarities and Dual Approach for Sequential Text-to-Image Retrieval Sep 29, 2021 Graph Embedding Image Retrieval
— Unverified 0OSVidCap: A Framework for the Simultaneous Recognition and Description of Concurrent Actions in Videos in an Open-Set Scenario Sep 29, 2021 Decoder Open Set Video Captioning
Code Code Available 0Hierarchical Multimodal Transformer to Summarize Videos Sep 22, 2021 Machine Translation Supervised Video Summarization
— Unverified 0