Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning Dec 5, 2024 Comment Generation Decoder
Code Code Available 0Language-Driven Region Pointer Advancement for Controllable Image Captioning Nov 30, 2020 controllable image captioning Image Captioning
Code Code Available 0Compositional Generalization in Image Captioning Sep 10, 2019 Caption Generation Image Captioning
Code Code Available 0Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning Sep 16, 2021 Decoder Image Captioning
Code Code Available 0Kvasir-VQA: A Text-Image Pair GI Tract Dataset Sep 2, 2024 Image Captioning Image Generation
Code Code Available 0Efficient Modeling of Future Context for Image Captioning Jul 22, 2022 Image Captioning Sentence
Code Code Available 0LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model May 3, 2024 Image Captioning Instruction Following
Code Code Available 0RONA: Pragmatically Diverse Image Captioning with Coherence Relations Mar 14, 2025 Diversity Image Captioning
Code Code Available 0LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting May 31, 2023 Decoder Image Captioning
Code Code Available 0Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning Dec 6, 2016 Decoder Image Captioning
Code Code Available 0RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering Oct 19, 2023 Image Captioning Question Answering
Code Code Available 0COMIC: Towards A Compact Image Captioning Model with Attention Mar 4, 2019 Decoder Image Captioning
Code Code Available 0Efficient CNN-LSTM based Image Captioning using Neural Network Compression Dec 17, 2020 Decoder Image Captioning
Code Code Available 0Look and Modify: Modification Networks for Image Captioning Sep 7, 2019 Decoder Image Captioning
Code Code Available 0Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations May 15, 2019 Image Captioning Question Answering
Code Code Available 0Token-level and sequence-level loss smoothing for RNN language models May 14, 2018 Image Captioning Machine Translation
Code Code Available 0Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization Sep 22, 2024 Hallucination Hallucination Evaluation
Code Code Available 0Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers Apr 21, 2024 Diagnostic Image Captioning
Code Code Available 0Topic-Guided Attention for Image Captioning Jul 10, 2018 Image Captioning
Code Code Available 0KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph Sep 17, 2024 cross-modal alignment Image Captioning
Code Code Available 0JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts Dec 18, 2024 Action Detection Descriptive
Code Code Available 0Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning Aug 5, 2021 Image Captioning Object
Code Code Available 0JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images Sep 19, 2024 Hallucination Image Captioning
Code Code Available 0Does the Performance of Text-to-Image Retrieval Models Generalize Beyond Captions-as-a-Query? Mar 15, 2024 Descriptive Image Captioning
Code Code Available 0Meshed-Memory Transformer for Image Captioning Dec 17, 2019 Image Captioning Machine Translation
Code Code Available 0Automated Image Captioning with CNNs and Transformers Dec 13, 2024 Descriptive Hyperparameter Optimization
Code Code Available 0Journalistic Guidelines Aware News Image Captioning Sep 7, 2021 Caption Generation Descriptive
Code Code Available 0JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models Nov 7, 2023 Image Captioning
Code Code Available 0Machine-in-the-Loop Rewriting for Creative Image Captioning Nov 7, 2021 Descriptive Image Captioning
Code Code Available 0Aligning Linguistic Words and Visual Semantic Units for Image Captioning Aug 6, 2019 Attribute Image Captioning
Code Code Available 0Auto-Encoding Scene Graphs for Image Captioning Dec 6, 2018 Decoder Image Captioning
Code Code Available 0Document Modeling with External Attention for Sentence Extraction Jul 1, 2018 Answer Selection Document Summarization
Code Code Available 0iPIC-XAI: Improving PIC-XAI for Enhanced Image Captioning Explanation Sep 23, 2023 Image Captioning TAG
Code Code Available 0Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport May 29, 2025 Document Level Machine Translation Image Captioning
Code Code Available 0SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Nov 17, 2016 Image Captioning Sentence
Code Code Available 0Diverse Image Captioning with Grounded Style May 3, 2022 Attribute Diversity
Code Code Available 0iParaphrasing: Extracting Visually Grounded Paraphrases via an Image Jun 12, 2018 Image Captioning Question Answering
Code Code Available 0Cold-Start Reinforcement Learning with Softmax Policy Gradient Sep 27, 2017 Image Captioning Policy Gradient Methods
Code Code Available 0Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts Jul 7, 2020 Image Captioning Sentence
Code Code Available 0ZoDIAC: Zoneout Dropout Injection Attention Calculation Jun 28, 2022 Image Captioning image-classification
Code Code Available 0Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding Sep 1, 2023 Graph Generation Image Captioning
Code Code Available 0InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Nov 30, 2023 Image Captioning Referring Expression
Code Code Available 0A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Oct 10, 2024 Fairness Image Captioning
Code Code Available 0mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs Jul 13, 2023 Image Captioning
Code Code Available 0Improving Reinforcement Learning Based Image Captioning with Natural Language Prior Sep 13, 2018 Image Captioning reinforcement-learning
Code Code Available 0Wasserstein Barycenter Model Ensembling Feb 13, 2019 Attribute General Classification
Code Code Available 0Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering Sep 9, 2023 Image Captioning Image-text matching
Code Code Available 0Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process Aug 14, 2019 Diversity Image Captioning
Code Code Available 0SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval May 21, 2025 counterfactual Graph Generation
Code Code Available 0Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks Jun 9, 2015 Constituency Parsing Image Captioning
Code Code Available 0