Attention-Based Multimodal Fusion for Video Description Jan 11, 2017 Decoder Sentence
— Unverified 0Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions Aug 27, 2018 Translation Video Description
— Unverified 0AVD2: Accident Video Diffusion for Accident Video Description Feb 20, 2025 Autonomous Driving Scene Understanding
— Unverified 0Relational Graph Learning for Grounded Video Description Generation Dec 2, 2021 Graph Learning Hallucination
— Unverified 0Saarland: Vector-based models of semantic textual similarity Jul 1, 2012 Semantic Textual Similarity Video Description
— Unverified 0Semantic Neighborhoods as Hypergraphs Aug 1, 2013 Machine Translation Paraphrase Generation
— Unverified 0SHEF-Multimodal: Grounding Machine Translation on Images Aug 1, 2016 Machine Translation Multimodal Machine Translation
— Unverified 0SRIUBC: Simple Similarity Features for Semantic Textual Similarity Jul 1, 2012 Natural Language Inference Paraphrase Identification
— Unverified 0Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation Dec 28, 2021 Image Captioning Machine Translation
— Unverified 0Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description Jul 1, 2017 Video Captioning Video Description
— Unverified 0Technical Report: Competition Solution For Modelscope-Sora Sep 24, 2024 Text-to-Video Generation Video Description
— Unverified 0The Role of the Input in Natural Language Video Description Feb 9, 2021 Data Augmentation Video Description
— Unverified 0Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 0Unbox the Blackbox: Predict and Interpret YouTube Viewership Using Deep Learning Dec 21, 2020 Misinformation Prediction
— Unverified 0Vectors of Locally Aggregated Centers for Compact Video Representation Sep 13, 2015 Clustering Video Description
— Unverified 0VideoA11y: Method and Dataset for Accessible Video Description Feb 27, 2025 Video Description
— Unverified 0VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models Oct 1, 2024 Hallucination text similarity
— Unverified 0Video Description: A Survey of Methods, Datasets and Evaluation Metrics Jun 1, 2018 Diversity Language Modeling
— Unverified 0VideoMCC: a New Benchmark for Video Comprehension Jun 23, 2016 Multiple-choice Video Description
— Unverified 0Visual-aware Attention Dual-stream Decoder for Video Captioning Oct 16, 2021 Decoder Video Captioning
— Unverified 0A Comprehensive Review on Recent Methods and Challenges of Video Description Nov 30, 2020 Machine Translation Survey
— Unverified 0JU\_CSE\_NLP: Multi-grade Classification of Semantic Similarity between Text Pairs Jul 1, 2012 General Classification Semantic Similarity
— Unverified 0Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Aug 19, 2024 Instruction Following Large Language Model
— Unverified 0LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living Jun 13, 2024 Benchmarking Human-Object Interaction Detection
— Unverified 0MSR-VTT: A Large Video Description Dataset for Bridging Video and Language Jun 1, 2016 Image Captioning Sentence
— Unverified 0MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish Dec 13, 2020 Machine Translation Multimodal Machine Translation
— Unverified 0Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering Jan 3, 2020 Question Answering Video Description
— Unverified 0Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data Jul 1, 2018 Image Description Machine Translation
— Unverified 0Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews) Jan 23, 2024 Miscellaneous Video Description
— Unverified 0Multi Sentence Description of Complex Manipulation Action Videos Nov 13, 2023 Decoder Sentence
— Unverified 0NarrationBot and InfoBot: A Hybrid System for Automated Video Description Nov 7, 2021 Video Description
— Unverified 0Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis Aug 1, 2016 Action Classification Object Recognition
— Unverified 0Neural Headline Generation on Abstract Meaning Representation Nov 1, 2016 Abstract Meaning Representation Dependency Parsing
— Unverified 0Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model May 12, 2016 Language Modeling Language Modelling
— Unverified 0Probabilistic Soft Logic for Semantic Textual Similarity Jun 1, 2014 Semantic Textual Similarity Video Description
— Unverified 0PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation Oct 30, 2024 Anomaly Detection Descriptive
— Unverified 0JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models Mar 5, 2024 In-Context Learning Video Description
Code Code Available 0Predicting Visual Features from Text for Image and Video Caption Retrieval Sep 5, 2017 Retrieval Sentence
Code Code Available 0Describing Videos by Exploiting Temporal Structure Feb 27, 2015 Action Recognition Image Description
Code Code Available 0Learn to Understand Negation in Video Retrieval Apr 30, 2022 Natural Language Queries Negation
Code Code Available 0Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents Aug 18, 2020 Video Description
Code Code Available 0Memory-augmented Attention Modelling for Videos Nov 7, 2016 Video Description
Code Code Available 0TGIF: A New Dataset and Benchmark on Animated GIF Description Apr 10, 2016 Image Captioning Machine Translation
Code Code Available 0MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian Jun 20, 2023 Cross-Lingual Transfer Retrieval
Code Code Available 0Adversarial Inference for Multi-Sentence Video Description Dec 13, 2018 Diversity Image Captioning
Code Code Available 0Egocentric Video Description based on Temporally-Linked Sequences Apr 7, 2017 Decoder Video Description
Code Code Available 0Video Description using Bidirectional Recurrent Neural Networks Apr 12, 2016 Decoder Text Generation
Code Code Available 0Edit As You Wish: Video Caption Editing with Multi-grained User Control May 15, 2023 Attribute Position
Code Code Available 0Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Dec 17, 2024 Dense Video Captioning Descriptive
Code Code Available 0SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous Driving Jul 18, 2024 Autonomous Driving Image Generation
Code Code Available 0