Semantic Neighborhoods as Hypergraphs Aug 1, 2013 Machine Translation Paraphrase Generation
— Unverified 0SHEF-Multimodal: Grounding Machine Translation on Images Aug 1, 2016 Machine Translation Multimodal Machine Translation
— Unverified 0SRIUBC: Simple Similarity Features for Semantic Textual Similarity Jul 1, 2012 Natural Language Inference Paraphrase Identification
— Unverified 0Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation Dec 28, 2021 Image Captioning Machine Translation
— Unverified 0Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description Jul 1, 2017 Video Captioning Video Description
— Unverified 0Technical Report: Competition Solution For Modelscope-Sora Sep 24, 2024 Text-to-Video Generation Video Description
— Unverified 0The Role of the Input in Natural Language Video Description Feb 9, 2021 Data Augmentation Video Description
— Unverified 0Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 0Unbox the Blackbox: Predict and Interpret YouTube Viewership Using Deep Learning Dec 21, 2020 Misinformation Prediction
— Unverified 0Vectors of Locally Aggregated Centers for Compact Video Representation Sep 13, 2015 Clustering Video Description
— Unverified 0VideoA11y: Method and Dataset for Accessible Video Description Feb 27, 2025 Video Description
— Unverified 0VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models Oct 1, 2024 Hallucination text similarity
— Unverified 0Video Description: A Survey of Methods, Datasets and Evaluation Metrics Jun 1, 2018 Diversity Language Modeling
— Unverified 0VideoMCC: a New Benchmark for Video Comprehension Jun 23, 2016 Multiple-choice Video Description
— Unverified 0Visual-aware Attention Dual-stream Decoder for Video Captioning Oct 16, 2021 Decoder Video Captioning
— Unverified 0A Comprehensive Review on Recent Methods and Challenges of Video Description Nov 30, 2020 Machine Translation Survey
— Unverified 0Incorporating Background Knowledge into Video Description Generation Oct 1, 2018 Decoder Text Generation
— Unverified 0Incorporating Global Visual Features into Attention-based Neural Machine Translation. Sep 1, 2017 Decoder Machine Translation
— Unverified 0Incorporating Semantic Attention in Video Description Generation May 1, 2018 Image Captioning Image Classification
— Unverified 0Integrating both Visual and Audio Cues for Enhanced Video Caption Nov 22, 2017 Descriptive Sentence
— Unverified 0Interpretable Video Captioning via Trajectory Structured Localization Jun 1, 2018 Decoder Image Captioning
— Unverified 0JU\_CSE\_NLP: Multi-grade Classification of Semantic Similarity between Text Pairs Jul 1, 2012 General Classification Semantic Similarity
— Unverified 0Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Aug 19, 2024 Instruction Following Large Language Model
— Unverified 0LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living Jun 13, 2024 Benchmarking Human-Object Interaction Detection
— Unverified 0MSR-VTT: A Large Video Description Dataset for Bridging Video and Language Jun 1, 2016 Image Captioning Sentence
— Unverified 0MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish Dec 13, 2020 Machine Translation Multimodal Machine Translation
— Unverified 0Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering Jan 3, 2020 Question Answering Video Description
— Unverified 0Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data Jul 1, 2018 Image Description Machine Translation
— Unverified 0Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews) Jan 23, 2024 Miscellaneous Video Description
— Unverified 0Multi Sentence Description of Complex Manipulation Action Videos Nov 13, 2023 Decoder Sentence
— Unverified 0NarrationBot and InfoBot: A Hybrid System for Automated Video Description Nov 7, 2021 Video Description
— Unverified 0Natural Language Descriptions of Human Activities Scenes: Corpus Generation and Analysis Aug 1, 2016 Action Classification Object Recognition
— Unverified 0Neural Headline Generation on Abstract Meaning Representation Nov 1, 2016 Abstract Meaning Representation Dependency Parsing
— Unverified 0Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model May 12, 2016 Language Modeling Language Modelling
— Unverified 0Probabilistic Soft Logic for Semantic Textual Similarity Jun 1, 2014 Semantic Textual Similarity Video Description
— Unverified 0Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Apr 6, 2016 Descriptive Language Modeling
Code Code Available 0SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous Driving Jul 18, 2024 Autonomous Driving Image Generation
Code Code Available 0VizSeq: A Visual Analysis Toolkit for Text Generation Tasks Sep 12, 2019 Benchmarking Image Captioning
Code Code Available 0End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features Jun 21, 2018 Question Answering Video Description
Code Code Available 0Egocentric Video Description based on Temporally-Linked Sequences Apr 7, 2017 Decoder Video Description
Code Code Available 0Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Dec 17, 2024 Dense Video Captioning Descriptive
Code Code Available 0JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models Mar 5, 2024 In-Context Learning Video Description
Code Code Available 0Predicting Visual Features from Text for Image and Video Caption Retrieval Sep 5, 2017 Retrieval Sentence
Code Code Available 0Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents Aug 18, 2020 Video Description
Code Code Available 0Learn to Understand Negation in Video Retrieval Apr 30, 2022 Natural Language Queries Negation
Code Code Available 0A Mid-level Video Representation based on Binary Descriptors: A Case Study for Pornography Detection May 12, 2016 Pornography Detection Video Description
Code Code Available 0Memory-augmented Attention Modelling for Videos Nov 7, 2016 Video Description
Code Code Available 0TGIF: A New Dataset and Benchmark on Animated GIF Description Apr 10, 2016 Image Captioning Machine Translation
Code Code Available 0MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian Jun 20, 2023 Cross-Lingual Transfer Retrieval
Code Code Available 0Adversarial Inference for Multi-Sentence Video Description Dec 13, 2018 Diversity Image Captioning
Code Code Available 0