Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents Aug 18, 2020 Video Description
Code Code Available 0Active Learning for Video Description With Cluster-Regularized Ensemble Ranking Jul 27, 2020 Active Learning Video Captioning
— Unverified 0Delving Deeper into the Decoder for Video Captioning Jan 16, 2020 Decoder Sentence
Code Code Available 1Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering Jan 3, 2020 Question Answering Video Description
— Unverified 0VizSeq: A Visual Analysis Toolkit for Text Generation Tasks Sep 12, 2019 Benchmarking Image Captioning
Code Code Available 0Prediction and Description of Near-Future Activities in Video Aug 2, 2019 Prediction Video Captioning
— Unverified 0VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research Apr 6, 2019 Machine Translation Translation
Code Code Available 1End-to-End Video Captioning Apr 4, 2019 Action Recognition Caption Generation
— Unverified 0Grounded Video Description Dec 17, 2018 Image Description Sentence
Code Code Available 1Adversarial Inference for Multi-Sentence Video Description Dec 13, 2018 Diversity Image Captioning
Code Code Available 0Incorporating Background Knowledge into Video Description Generation Oct 1, 2018 Decoder Text Generation
— Unverified 0A Dataset for Telling the Stories of Social Media Videos Oct 1, 2018 Sentence Video Captioning
— Unverified 0Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions Aug 27, 2018 Translation Video Description
— Unverified 0Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 0Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data Jul 1, 2018 Image Description Machine Translation
— Unverified 0End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features Jun 21, 2018 Question Answering Video Description
Code Code Available 0Interpretable Video Captioning via Trajectory Structured Localization Jun 1, 2018 Decoder Image Captioning
— Unverified 0Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 Jun 1, 2018 Video Description Visual Dialog
Code Code Available 1Video Description: A Survey of Methods, Datasets and Evaluation Metrics Jun 1, 2018 Diversity Language Modeling
— Unverified 0Incorporating Semantic Attention in Video Description Generation May 1, 2018 Image Captioning Image Classification
— Unverified 0Integrating both Visual and Audio Cues for Enhanced Video Caption Nov 22, 2017 Descriptive Sentence
— Unverified 0Attend and Interact: Higher-Order Object Interactions for Video Understanding Nov 16, 2017 Action Classification Action Recognition
— Unverified 0Predicting Visual Features from Text for Image and Video Caption Retrieval Sep 5, 2017 Retrieval Sentence
Code Code Available 0Incorporating Global Visual Features into Attention-based Neural Machine Translation. Sep 1, 2017 Decoder Machine Translation
— Unverified 0Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description Jul 1, 2017 Video Captioning Video Description
— Unverified 0Egocentric Video Description based on Temporally-Linked Sequences Apr 7, 2017 Decoder Video Description
Code Code Available 0Attention-Based Multimodal Fusion for Video Description Jan 11, 2017 Decoder Sentence
— Unverified 0Generating Video Description using Sequence-to-sequence Model with Temporal Attention Dec 1, 2016 Caption Generation Sentence
— Unverified 0Hierarchical Boundary-Aware Neural Encoder for Video Captioning Nov 28, 2016 Decoder Video Captioning
— Unverified 0Memory-augmented Attention Modelling for Videos Nov 7, 2016 Video Description
Code Code Available 0Neural Headline Generation on Abstract Meaning Representation Nov 1, 2016 Abstract Meaning Representation Dependency Parsing
— Unverified 0Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis Aug 1, 2016 Action Classification Object Recognition
— Unverified 0SHEF-Multimodal: Grounding Machine Translation on Images Aug 1, 2016 Machine Translation Multimodal Machine Translation
— Unverified 0VideoMCC: a New Benchmark for Video Comprehension Jun 23, 2016 Multiple-choice Video Description
— Unverified 0Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 0MSR-VTT: A Large Video Description Dataset for Bridging Video and Language Jun 1, 2016 Image Captioning Sentence
— Unverified 0Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model May 12, 2016 Language Modeling Language Modelling
— Unverified 0A Mid-level Video Representation based on Binary Descriptors: A Case Study for Pornography Detection May 12, 2016 Pornography Detection Video Description
Code Code Available 0Video Description using Bidirectional Recurrent Neural Networks Apr 12, 2016 Decoder Text Generation
Code Code Available 0TGIF: A New Dataset and Benchmark on Animated GIF Description Apr 10, 2016 Image Captioning Machine Translation
Code Code Available 0Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Apr 6, 2016 Descriptive Language Modeling
Code Code Available 0Vectors of Locally Aggregated Centers for Compact Video Representation Sep 13, 2015 Clustering Video Description
— Unverified 0A Multi-scale Multiple Instance Video Description Network May 21, 2015 Image Segmentation Multiple Instance Learning
— Unverified 0Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research Mar 3, 2015 Descriptive Video Description
Code Code Available 1Describing Videos by Exploiting Temporal Structure Feb 27, 2015 Action Recognition Image Description
Code Code Available 0Probabilistic Soft Logic for Semantic Textual Similarity Jun 1, 2014 Semantic Textual Similarity Video Description
— Unverified 0Coherent Multi-Sentence Video Description with Variable Level of Detail Mar 24, 2014 Sentence Video Description
— Unverified 0Semantic Neighborhoods as Hypergraphs Aug 1, 2013 Machine Translation Paraphrase Generation
— Unverified 0HENRY-CORE: Domain Adaptation and Stacking for Text Similarity Jun 1, 2013 Domain Adaptation Machine Translation
— Unverified 0Better Exploiting Motion for Better Action Recognition Jun 1, 2013 Action Recognition Image Retrieval
— Unverified 0