Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention Sep 7, 2021 Sensor Fusion Video Captioning
Code Code Available 0Cross-Modal Graph with Meta Concepts for Video Captioning Aug 14, 2021 object-detection Object Detection
Code Code Available 0O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning Aug 5, 2021 Attribute Caption Generation
— Unverified 0Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers Aug 4, 2021 Video Captioning
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability Jun 25, 2021 Bias Detection Question Answering
— Unverified 0Sketch, Ground, and Refine: Top-Down Dense Video Captioning Jun 19, 2021 Dense Video Captioning Sentence
Code Code Available 0Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning Jun 19, 2021 Sentence Video Captioning
— Unverified 0Attention based video captioning framework for Hindi Jun 17, 2021 Video Captioning
— Unverified 0VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
— Unverified 0Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching May 18, 2021 Caption Generation Cross-Modal Retrieval
— Unverified 0FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework Apr 9, 2021 Language Modelling Multiple-choice
Code Code Available 0The Use of Video Captioning for Fostering Physical Activity Apr 7, 2021 Action Detection object-detection
— Unverified 0Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Apr 7, 2021 Descriptive Text Summarization
— Unverified 0CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 0Open-book Video Captioning with Retrieve-Copy-Generate Network Mar 9, 2021 Decoder Retrieval
— Unverified 0Recent Advances in Video Question Answering: A Review of Datasets and Methods Jan 15, 2021 Information Retrieval Machine Translation
— Unverified 0Exploration of Visual Features and their weighted-additive fusion for Video Captioning Jan 14, 2021 Video Captioning
— Unverified 0Video Captioning in Compressed Video Jan 2, 2021 Caption Generation Video Captioning
— Unverified 0Motion Guided Region Message Passing for Video Captioning Jan 1, 2021 Decoder Video Captioning
— Unverified 0Guidance Module Network for Video Captioning Dec 20, 2020 Decoder Sentence
— Unverified 0MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish Dec 13, 2020 Machine Translation Multimodal Machine Translation
— Unverified 0Understanding Action Sequences based on Video Captioning for Learning-from-Observation Dec 9, 2020 Video Captioning Video Understanding
— Unverified 0iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering Nov 16, 2020 Common Sense Reasoning Dense Video Captioning
— Unverified 0ActBERT: Learning Global-Local Video-Text Representations Nov 14, 2020 Action Segmentation Question Answering
Code Code Available 0Semi-Supervised Learning for Video Captioning Nov 1, 2020 Video Captioning
— Unverified 0Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications Oct 27, 2020 speech-recognition Speech Recognition
— Unverified 0TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval Sep 21, 2020 Action Detection Activity Detection
— Unverified 0Video captioning with stacked attention and semantic hard pull Sep 15, 2020 Decoder Video Captioning
Code Code Available 0Video Captioning Using Weak Annotation Sep 2, 2020 Sentence Video Captioning
— Unverified 0Hierarchical memory decoder for visual narrating Sep 1, 2020 Decoder Image Captioning
— Unverified 0In-Home Daily-Life Captioning Using Radio Signals Aug 25, 2020 Privacy Preserving Video Captioning
— Unverified 0Enriching Video Captions With Contextual Text Jul 29, 2020 Video Captioning
Code Code Available 0Pre-training for Video Captioning Challenge 2020 Summary Jul 27, 2020 Video Captioning
— Unverified 0Active Learning for Video Description With Cluster-Regularized Ensemble Ranking Jul 27, 2020 Active Learning Video Captioning
— Unverified 0SBAT: Video Captioning with Sparse Boundary-Aware Transformer Jul 23, 2020 Machine Translation multimodal interaction
— Unverified 0Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation Jul 12, 2020 Decoder Graph-to-Sequence
— Unverified 0Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training Jul 5, 2020 Decoder Question Answering
— Unverified 0SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning Jun 25, 2020 Dense Video Captioning Video Captioning
— Unverified 0Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning Jun 14, 2020 Dense Captioning Dense Video Captioning
— Unverified 0NITS-VC System for VATEX Video Captioning Challenge 2020 Jun 7, 2020 Decoder Machine Translation
— Unverified 0Screencast Tutorial Video Understanding Jun 1, 2020 object-detection Object Detection
Code Code Available 0Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding May 16, 2020 Abstractive Text Summarization Decoder
— Unverified 0Spatio-Temporal Graph for Video Captioning with Knowledge Distillation Mar 31, 2020 Knowledge Distillation Object
— Unverified 0Normalized and Geometry-Aware Self-Attention Network for Image Captioning Mar 19, 2020 Image Captioning Machine Translation
— Unverified 0OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement Mar 8, 2020 Object Sentence
— Unverified 0Hierarchical Memory Decoding for Video Captioning Feb 27, 2020 Decoder Video Captioning
— Unverified 0Object Relational Graph with Teacher-Recommended Learning for Video Captioning Feb 26, 2020 Language Modeling Language Modelling
— Unverified 0Spatio-Temporal Ranked-Attention Networks for Video Captioning Jan 17, 2020 Video Captioning
— Unverified 0Vision and Language: from Visual Perception to Content Creation Dec 26, 2019 Decoder Question Answering
— Unverified 0