Multimodal Pretraining for Dense Video Captioning Nov 10, 2020 Dense Video Captioning Video Captioning
Code Code Available 1Semi-Supervised Learning for Video Captioning Nov 1, 2020 Video Captioning
— Unverified 0COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning Nov 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 1Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications Oct 27, 2020 speech-recognition Speech Recognition
— Unverified 0Improved Actor Relation Graph based Group Activity Recognition Oct 24, 2020 Activity Recognition Group Activity Recognition
Code Code Available 1TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval Sep 21, 2020 Action Detection Activity Detection
— Unverified 0Video captioning with stacked attention and semantic hard pull Sep 15, 2020 Decoder Video Captioning
Code Code Available 0Video Captioning Using Weak Annotation Sep 2, 2020 Sentence Video Captioning
— Unverified 0Hierarchical memory decoder for visual narrating Sep 1, 2020 Decoder Image Captioning
— Unverified 0In-Home Daily-Life Captioning Using Radio Signals Aug 25, 2020 Privacy Preserving Video Captioning
— Unverified 0Poet: Product-oriented Video Captioner for E-commerce Aug 16, 2020 Video Captioning
Code Code Available 1SODA: Story Oriented Dense Video Captioning Evaluation Framework Aug 1, 2020 Dense Video Captioning Video Captioning
Code Code Available 1Learning to Generate Grounded Visual Captions without Localization Supervision Aug 1, 2020 Image Captioning Language Modelling
Code Code Available 1Enriching Video Captions With Contextual Text Jul 29, 2020 Video Captioning
Code Code Available 0Pre-training for Video Captioning Challenge 2020 Summary Jul 27, 2020 Video Captioning
— Unverified 0Active Learning for Video Description With Cluster-Regularized Ensemble Ranking Jul 27, 2020 Active Learning Video Captioning
— Unverified 0SBAT: Video Captioning with Sparse Boundary-Aware Transformer Jul 23, 2020 Machine Translation multimodal interaction
— Unverified 0Learning to Discretely Compose Reasoning Module Networks for Video Captioning Jul 17, 2020 Decoder Question Answering
Code Code Available 1Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation Jul 12, 2020 Decoder Graph-to-Sequence
— Unverified 0Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training Jul 5, 2020 Decoder Question Answering
— Unverified 0SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning Jun 25, 2020 Dense Video Captioning Video Captioning
— Unverified 0Comprehensive Information Integration Modeling Framework for Video Titling Jun 24, 2020 Descriptive Video Captioning
Code Code Available 1Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020 Jun 21, 2020 Dense Captioning Dense Video Captioning
Code Code Available 1Video Moment Localization using Object Evidence and Reverse Captioning Jun 18, 2020 Language-Based Temporal Localization Language Modelling
Code Code Available 1Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning Jun 14, 2020 Dense Captioning Dense Video Captioning
— Unverified 0NITS-VC System for VATEX Video Captioning Challenge 2020 Jun 7, 2020 Decoder Machine Translation
— Unverified 0Syntax-Aware Action Targeting for Video Captioning Jun 1, 2020 Video Captioning
Code Code Available 1Screencast Tutorial Video Understanding Jun 1, 2020 object-detection Object Detection
Code Code Available 0A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer May 17, 2020 Dense Video Captioning Temporal Action Proposal Generation
Code Code Available 1Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding May 16, 2020 Abstractive Text Summarization Decoder
— Unverified 0MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning May 11, 2020 Sentence Video Captioning
Code Code Available 1A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos May 2, 2020 Action Detection Form
Code Code Available 1HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training May 1, 2020 Language Modeling Language Modelling
Code Code Available 1Spatio-Temporal Graph for Video Captioning with Knowledge Distillation Mar 31, 2020 Knowledge Distillation Object
— Unverified 0Normalized and Geometry-Aware Self-Attention Network for Image Captioning Mar 19, 2020 Image Captioning Machine Translation
— Unverified 0Multi-modal Dense Video Captioning Mar 17, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning Mar 11, 2020 Question Answering Video Captioning
Code Code Available 1OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement Mar 8, 2020 Object Sentence
— Unverified 0Hierarchical Memory Decoding for Video Captioning Feb 27, 2020 Decoder Video Captioning
— Unverified 0Object Relational Graph with Teacher-Recommended Learning for Video Captioning Feb 26, 2020 Language Modeling Language Modelling
— Unverified 0UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation Feb 15, 2020 Action Segmentation Decoder
Code Code Available 1Spatio-Temporal Ranked-Attention Networks for Video Captioning Jan 17, 2020 Video Captioning
— Unverified 0Delving Deeper into the Decoder for Video Captioning Jan 16, 2020 Decoder Sentence
Code Code Available 1Vision and Language: from Visual Perception to Content Creation Dec 26, 2019 Decoder Question Answering
— Unverified 0Meaning guided video captioning Dec 12, 2019 Decoder object-detection
Code Code Available 0Multimodal Machine Translation through Visuals and Speech Nov 28, 2019 Image Captioning Machine Translation
— Unverified 0Non-Autoregressive Coarse-to-Fine Video Captioning Nov 27, 2019 Sentence Video Captioning
Code Code Available 0Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models Nov 22, 2019 Decoder Video Captioning
— Unverified 0Empirical Autopsy of Deep Video Captioning Frameworks Nov 21, 2019 Decoder Language Modelling
— Unverified 0Multi-attention Networks for Temporal Localization of Video-level Labels Nov 15, 2019 Action Recognition Temporal Action Localization
Code Code Available 0