An Attempt towards Interpretable Audio-Visual Video Captioning Dec 7, 2018 Audio captioning Audio-Visual Video Captioning
— Unverified 0Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Dec 19, 2024 Depth Estimation Image Captioning
— Unverified 0Automatic Myanmar Image Captioning using CNN and LSTM-Based Language Model May 1, 2020 Image Captioning Language Modeling
— Unverified 0Cooperative image captioning Jul 26, 2019 Image Captioning
— Unverified 0Automated Report Generation for Lung Cytological Images Using a CNN Vision Classifier and Multiple-Transformer Text Decoders: Preliminary Study Mar 26, 2024 Decoder Image Captioning
— Unverified 0Convolutional Prototype Learning for Zero-Shot Recognition Oct 22, 2019 Attribute Image Captioning
— Unverified 0Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution Jan 1, 2025 Depth Estimation Image Captioning
— Unverified 0Analysis of Convolutional Decoder for Image Caption Generation Mar 8, 2021 Caption Generation Data Augmentation
— Unverified 0Controlled Caption Generation for Images Through Adversarial Attacks Jul 7, 2021 Caption Generation Image Captioning
— Unverified 0A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism Mar 3, 2022 Caption Generation Decoder
— Unverified 0Controllable Image Captioning via Prompting Dec 4, 2022 controllable image captioning Image Captioning
— Unverified 0Controllable Image Captioning Apr 28, 2022 controllable image captioning Decoder
— Unverified 0Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments Jun 4, 2016 Image Captioning Word Embeddings
— Unverified 0Control Image Captioning Spatially and Temporally Aug 1, 2021 Contrastive Learning Image Captioning
— Unverified 0Automated Audio Captioning with Recurrent Neural Networks Jun 30, 2017 Audio captioning Decoder
— Unverified 0Fluent and Accurate Image Captioning with a Self-Trained Reward Model Aug 29, 2024 Image Captioning Specificity
— Unverified 0Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Mar 14, 2022 Image Captioning Semantic Textual Similarity
— Unverified 0A Multimodal Memes Classification: A Survey and Open Research Issues Sep 17, 2020 Classification General Classification
— Unverified 0Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder Jun 29, 2021 Image Captioning Representation Learning
— Unverified 0Contrastive Learning for Image Captioning Oct 6, 2017 Contrastive Learning Image Captioning
— Unverified 0Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Jun 28, 2024 Image Captioning
— Unverified 0A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning Jun 26, 2019 Decoder Image Captioning
— Unverified 0AutoCaption: Image Captioning with Neural Architecture Search Dec 16, 2020 Decoder Image Captioning
— Unverified 0Continuous multilinguality with language vectors Apr 1, 2017 Image Captioning Language Modeling
— Unverified 0A Multimodal Approach for Cross-Domain Image Retrieval Mar 22, 2024 Image Captioning Image Retrieval
— Unverified 0Abstractive Document Summarization with a Graph-Based Attentional Neural Model Jul 1, 2017 Abstractive Text Summarization Document Summarization
— Unverified 0Fine-Grained Video Captioning through Scene Graph Consolidation Feb 23, 2025 Caption Generation Image Captioning
— Unverified 0Contextual Memory Trees Jul 17, 2018 General Classification Image Captioning
— Unverified 0Contextualized Keyword Representations for Multi-modal Retinal Image Captioning Apr 26, 2021 Avg Image Captioning
— Unverified 0AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark Oct 4, 2024 Image Captioning Video Understanding
— Unverified 0Contextual Emotion Recognition using Large Vision Language Models May 14, 2024 Decision Making Emotion Recognition
— Unverified 0Contextual Emotion Estimation from Image Captions Sep 22, 2023 Image Captioning Language Modelling
— Unverified 0A Unified Sequence Interface for Vision Tasks Jun 15, 2022 Image Captioning Instance Segmentation
— Unverified 0American == White in Multimodal Language-and-Image AI Jul 1, 2022 Image Captioning Question Answering
— Unverified 0Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity Mar 31, 2025 Image Captioning Optical Character Recognition
— Unverified 0Augmenting Image Question Answering Dataset by Exploiting Image Captions May 1, 2018 Data Augmentation Image Captioning
— Unverified 0A Dataset and Reranking Method for Multimodal MT of User-Generated Image Captions Mar 1, 2018 Image Captioning Machine Translation
— Unverified 0Context-Aware Group Captioning via Self-Attention and Contrastive Features Apr 7, 2020 Image Captioning
— Unverified 0A Medical Semantic-Assisted Transformer for Radiographic Report Generation Aug 22, 2022 Image Captioning Medical Report Generation
— Unverified 0FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Nov 23, 2024 Attribute Cross-Modal Retrieval
— Unverified 0Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Feb 23, 2024 Image Captioning Image Retrieval
— Unverified 0Focused Evaluation for Image Description with Binary Forced-Choice Tasks Aug 1, 2016 Image Captioning Image Description
— Unverified 0Generating Triples with Adversarial Networks for Scene Graph Construction Feb 7, 2018 Attribute graph construction
— Unverified 0A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction Dec 19, 2015 Atari Games Image Captioning
— Unverified 0ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries Oct 24, 2019 Image Captioning Object Recognition
— Unverified 0Feature Fusion Effects of Tensor Product Representation on (De)Compositional Network for Caption Generation for Images Dec 17, 2018 Caption Generation Image Captioning
— Unverified 0Consistent Multiple Sequence Decoding Apr 2, 2020 Decoder Diversity
— Unverified 0Consistency Model is an Effective Posterior Sample Approximation for Diffusion Inverse Solvers Feb 9, 2024 Image Captioning Semantic Segmentation
— Unverified 0AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0Feedback is Needed for Retakes: An Explainable Poor Image Notification Framework for the Visually Impaired Nov 17, 2022 Image Captioning
— Unverified 0