DVCFlow: Modeling Information Flow Towards Human-like Video Captioning Nov 19, 2021 Dense Video Captioning Diversity
— Unverified 0EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching Nov 17, 2021 Language Modelling Video Captioning
Code Code Available 1CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework Nov 16, 2021 Multiple-choice Question Answering
— Unverified 0E-MMAD: Multimodal Advertising Caption Generation Based on Structured Information Nov 16, 2021 Caption Generation valid
— Unverified 0Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks Nov 14, 2021 Action Classification Object
Code Code Available 1Visual-aware Attention Dual-stream Decoder for Video Captioning Oct 16, 2021 Decoder Video Captioning
— Unverified 0CLIP4Caption: CLIP for Video Caption Oct 13, 2021 Decoder Sentence
— Unverified 0CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 0OSVidCap: A Framework for the Simultaneous Recognition and Description of Concurrent Actions in Videos in an Open-Set Scenario Sep 29, 2021 Decoder Open Set Video Captioning
Code Code Available 0Graph Similarities and Dual Approach for Sequential Text-to-Image Retrieval Sep 29, 2021 Graph Embedding Image Retrieval
— Unverified 0Hierarchical Multimodal Transformer to Summarize Videos Sep 22, 2021 Machine Translation Supervised Video Summarization
— Unverified 0Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention Sep 7, 2021 Sensor Fusion Video Captioning
Code Code Available 0X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics Aug 18, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1End-to-End Dense Video Captioning with Parallel Decoding Aug 17, 2021 Caption Generation Dense Video Captioning
Code Code Available 1Cross-Modal Graph with Meta Concepts for Video Captioning Aug 14, 2021 object-detection Object Detection
Code Code Available 0Discriminative Latent Semantic Graph for Video Captioning Aug 8, 2021 Decoder Object
Code Code Available 1O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning Aug 5, 2021 Attribute Caption Generation
— Unverified 0Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers Aug 4, 2021 Video Captioning
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability Jun 25, 2021 Bias Detection Question Answering
— Unverified 0Sketch, Ground, and Refine: Top-Down Dense Video Captioning Jun 19, 2021 Dense Video Captioning Sentence
Code Code Available 0Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning Jun 19, 2021 Sentence Video Captioning
— Unverified 0Attention based video captioning framework for Hindi Jun 17, 2021 Video Captioning
— Unverified 0VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation Jun 8, 2021 Multi-Task Learning Question Answering
Code Code Available 1DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Jun 1, 2021 Question Answering Retrieval
Code Code Available 1VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
Code Code Available 0Improving Generation and Evaluation of Visual Stories via Semantic Consistency May 20, 2021 Image Generation Story Visualization
Code Code Available 1Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching May 18, 2021 Caption Generation Cross-Modal Retrieval
— Unverified 0FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework Apr 9, 2021 Language Modelling Multiple-choice
Code Code Available 0Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Apr 7, 2021 Descriptive Text Summarization
— Unverified 0The Use of Video Captioning for Fostering Physical Activity Apr 7, 2021 Action Detection object-detection
— Unverified 0CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 0Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval Apr 1, 2021 Retrieval Text Retrieval
Code Code Available 1A Comprehensive Review of the Video-to-Text Problem Mar 27, 2021 Question Answering Retrieval
Code Code Available 1Open-book Video Captioning with Retrieve-Copy-Generate Network Mar 9, 2021 Decoder Retrieval
— Unverified 0The MSR-Video to Text Dataset with Clean Annotations Feb 12, 2021 Sentence Video Captioning
Code Code Available 1Semantic Grouping Network for Video Captioning Feb 1, 2021 Video Captioning
Code Code Available 1Recent Advances in Video Question Answering: A Review of Datasets and Methods Jan 15, 2021 Information Retrieval Machine Translation
— Unverified 0Exploration of Visual Features and their weighted-additive fusion for Video Captioning Jan 14, 2021 Video Captioning
— Unverified 0A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules Jan 8, 2021 Decoder Deep Reinforcement Learning
Code Code Available 1Video Captioning in Compressed Video Jan 2, 2021 Caption Generation Video Captioning
— Unverified 0Motion Guided Region Message Passing for Video Captioning Jan 1, 2021 Decoder Video Captioning
— Unverified 0Guidance Module Network for Video Captioning Dec 20, 2020 Decoder Sentence
— Unverified 0MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish Dec 13, 2020 Machine Translation Multimodal Machine Translation
— Unverified 0Understanding Action Sequences based on Video Captioning for Learning-from-Observation Dec 9, 2020 Video Captioning Video Understanding
— Unverified 0TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Nov 23, 2020 Action Classification Action Localization
Code Code Available 1Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language Nov 18, 2020 Dictionary Learning Disentanglement
Code Code Available 1iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering Nov 16, 2020 Common Sense Reasoning Dense Video Captioning
— Unverified 0ActBERT: Learning Global-Local Video-Text Representations Nov 14, 2020 Action Segmentation Question Answering
Code Code Available 0