AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0Generating Video Descriptions with Topic Guidance Aug 31, 2017 Decoder Image Captioning
— Unverified 0Consensus Graph Representation Learning for Better Grounded Image Captioning Dec 2, 2021 Graph Representation Learning Hallucination
— Unverified 0Attr2Style: A Transfer Learning Approach for Inferring Fashion Styles via Apparel Attributes Aug 26, 2020 Attribute Image Captioning
— Unverified 0Attentive Tensor Product Learning Feb 20, 2018 Constituency Parsing Deep Learning
— Unverified 0Altogether: Image Captioning via Re-aligning Alt-text Oct 22, 2024 Image Captioning image-classification
— Unverified 0A Dataset and Benchmarks for Multimedia Social Analysis Jun 5, 2020 Image Captioning image-classification
— Unverified 0Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge Jun 18, 2023 Image Captioning Language Modelling
— Unverified 0Connecting Language and Vision to Actions Jul 1, 2018 Image Captioning Language Modeling
— Unverified 0Trust It or Not: Confidence-Guided Automatic Radiology Report Generation Jun 21, 2021 Decision Making Image Captioning
— Unverified 0Attentive Language Models Nov 1, 2017 Image Captioning Machine Translation
— Unverified 0Attention Strategies for Multi-Source Sequence-to-Sequence Learning Jul 1, 2017 Automatic Post-Editing Image Captioning
— Unverified 0The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models Aug 2, 2024 Image Captioning
— Unverified 0Generating Image Descriptions using Multilingual Data Sep 1, 2017 Image Captioning Language Modeling
— Unverified 0Attention Correctness in Neural Image Captioning May 31, 2016 Image Captioning
— Unverified 0Compressing Visual-linguistic Model via Knowledge Distillation Apr 5, 2021 Image Captioning Knowledge Distillation
— Unverified 0Compressed Image Captioning using CNN-based Encoder-Decoder Framework Apr 28, 2024 Decoder Image Captioning
— Unverified 0Generating image captions with external encyclopedic knowledge Oct 10, 2022 Caption Generation Image Captioning
— Unverified 0Generating Natural Language Descriptions for Semantic Representations of Human Brain Activity Aug 1, 2016 Image Captioning
— Unverified 0Generative Bridging Network for Neural Sequence Prediction Jun 1, 2018 Abstractive Text Summarization Image Captioning
— Unverified 0Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data Dec 11, 2023 Image Captioning Image-text Retrieval
— Unverified 0Attention Beam: An Image Captioning Approach Nov 3, 2020 Decoder Image Captioning
— Unverified 0Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation Jun 3, 2025 Caption Generation Image Captioning
— Unverified 0Generating Diverse and Descriptive Image Captions Using Visual Paraphrases Oct 1, 2019 Descriptive Diversity
— Unverified 0Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Jun 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Attention-based Multimodal Neural Machine Translation Aug 1, 2016 Image Captioning Machine Translation
— Unverified 0Generating captions without looking beyond objects Oct 12, 2016 Caption Generation Image Captioning
— Unverified 0Competence-based Multimodal Curriculum Learning for Medical Report Generation Jun 24, 2022 Image Captioning Medical Report Generation
— Unverified 0Comparing Recurrent and Convolutional Architectures for English-Hindi Neural Machine Translation Nov 1, 2017 Decoder Image Captioning
— Unverified 0All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 0Generating Description for Sequential Images with Local-Object Attention Conditioned on Global Semantic Context Nov 1, 2018 Image Captioning Text Generation
— Unverified 0Generating Diverse and Informative Natural Language Fashion Feedback Jun 15, 2019 Decoder Image Captioning
— Unverified 0Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets Jul 14, 2020 Image Captioning Retrieval
— Unverified 0Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning Mar 5, 2023 Image Captioning
— Unverified 0Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network Jul 9, 2020 Decoder Image Captioning
— Unverified 0Attend More Times for Image Captioning Dec 8, 2018 Image Captioning
— Unverified 0Generalized Visual Relation Detection with Diffusion Models Apr 16, 2025 Graph Generation Human-Object Interaction Detection
— Unverified 0ComicsPAP: understanding comic strips by picking the correct panel Mar 11, 2025 Image Captioning Visual Question Answering (VQA)
— Unverified 0Combine to Describe: Evaluating Compositional Generalization in Image Captioning May 1, 2022 Image Captioning
— Unverified 0Alleviating Noisy Data in Image Captioning with Cooperative Distillation Dec 21, 2020 Image Captioning
— Unverified 0A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection) May 2, 2024 Acoustic Scene Classification Event Detection
— Unverified 0A Baseline for Detecting Out-of-Distribution Examples in Image Captioning Jul 12, 2022 Image Captioning Out of Distribution (OOD) Detection
— Unverified 0Generalizing Image Captions for Image-Text Parallel Corpus Aug 1, 2013 Image Captioning Sentence Compression
— Unverified 0Generating Image Captions in Arabic using Root-Word Based Recurrent Neural Networks and Deep Neural Networks Jun 1, 2018 Caption Generation Image Captioning
— Unverified 0Generative Distribution Prediction: A Unified Approach to Multimodal Learning Feb 10, 2025 Domain Adaptation Image Captioning
— Unverified 0Cold Fusion: Training Seq2Seq Models Together with Language Models Aug 21, 2017 Image Captioning Language Modeling
— Unverified 0A Thorough Review on Recent Deep Learning Methodologies for Image Captioning Jul 28, 2021 Caption Generation Descriptive
— Unverified 0AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation Mar 18, 2022 Descriptive Image Captioning
— Unverified 0COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Feb 4, 2025 Image Captioning Panoptic Segmentation
— Unverified 0A TextGCN-Based Decoding Approach for Improving Remote Sensing Image Captioning Sep 27, 2024 Decoder Fairness
— Unverified 0