Are metrics measuring what they should? An evaluation of image captioning task metrics Jul 4, 2022 Image Captioning
— Unverified 00 A Review of Multi-Modal Large Language and Vision Models Mar 28, 2024 Image Captioning Prompt Engineering
— Unverified 00 ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding May 9, 2025 Image Captioning Object Recognition
— Unverified 00 A Scaled Encoder Decoder Network for Image Captioning in Hindi Dec 1, 2021 Decoder Deep Learning
— Unverified 00 A Self-Boosting Framework for Automated Radiographic Report Generation Jun 19, 2021 Image Captioning Image-text matching
— Unverified 00 A Self-Explainable Stylish Image Captioning Framework via Multi-References Oct 20, 2021 Image Captioning
— Unverified 00 A Self-Guided Framework for Radiology Report Generation Jun 19, 2022 Image Captioning Medical Report Generation
— Unverified 00 A sequential guiding network with attention for image captioning Nov 1, 2018 Decoder Image Captioning
— Unverified 00 As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? Mar 19, 2024 Adversarial Attack Image Captioning
— Unverified 00 Assessing Image Quality Issues for Real-World Problems Mar 27, 2020 Image Captioning Question Answering
— Unverified 00 Assisting Scene Graph Generation with Self-Supervision Aug 8, 2020 Graph Generation Image Captioning
— Unverified 00 Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review Jun 28, 2024 Active Learning Image Captioning
— Unverified 00 Astrea: A MOE-based Visual Understanding Model with Progressive Alignment Mar 12, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 00 AstroLLaVA: towards the unification of astronomical data and natural language Apr 11, 2025 Astronomy Image Captioning
— Unverified 00 A Survey of Evaluation Metrics Used for NLG Systems Aug 27, 2020 Image Captioning nlg evaluation
— Unverified 00 A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jun 12, 2023 Image Captioning Machine Translation
— Unverified 00 A survey on knowledge-enhanced multimodal learning Nov 19, 2022 Conditional Image Generation Factual Visual Question Answering
— Unverified 00 A Survey on Large Language Models from Concept to Implementation Mar 27, 2024 Chatbot Image Captioning
— Unverified 00 Asynchronous Evolution of Deep Neural Network Architectures Aug 8, 2023 Evolutionary Algorithms Image Captioning
— Unverified 00 A TextGCN-Based Decoding Approach for Improving Remote Sensing Image Captioning Sep 27, 2024 Decoder Fairness
— Unverified 00 A Thorough Review on Recent Deep Learning Methodologies for Image Captioning Jul 28, 2021 Caption Generation Descriptive
— Unverified 00 A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection) May 2, 2024 Acoustic Scene Classification Event Detection
— Unverified 00 Attend More Times for Image Captioning Dec 8, 2018 Image Captioning
— Unverified 00 Attention-based Multimodal Neural Machine Translation Aug 1, 2016 Image Captioning Machine Translation
— Unverified 00 Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation Jun 3, 2025 Caption Generation Image Captioning
— Unverified 00 Attention Beam: An Image Captioning Approach Nov 3, 2020 Decoder Image Captioning
— Unverified 00 Attention Correctness in Neural Image Captioning May 31, 2016 Image Captioning
— Unverified 00 Attention Strategies for Multi-Source Sequence-to-Sequence Learning Jul 1, 2017 Automatic Post-Editing Image Captioning
— Unverified 00 Attentive Language Models Nov 1, 2017 Image Captioning Machine Translation
— Unverified 00 Attentive Tensor Product Learning Feb 20, 2018 Constituency Parsing Deep Learning
— Unverified 00 Attr2Style: A Transfer Learning Approach for Inferring Fashion Styles via Apparel Attributes Aug 26, 2020 Attribute Image Captioning
— Unverified 00 AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 00 ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries Oct 24, 2019 Image Captioning Object Recognition
— Unverified 00 Augmenting Image Question Answering Dataset by Exploiting Image Captions May 1, 2018 Data Augmentation Image Captioning
— Unverified 00 A Unified Sequence Interface for Vision Tasks Jun 15, 2022 Image Captioning Instance Segmentation
— Unverified 00 AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark Oct 4, 2024 Image Captioning Video Understanding
— Unverified 00 AutoCaption: Image Captioning with Neural Architecture Search Dec 16, 2020 Decoder Image Captioning
— Unverified 00 Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Jun 28, 2024 Image Captioning
— Unverified 00 Automated Audio Captioning with Recurrent Neural Networks Jun 30, 2017 Audio captioning Decoder
— Unverified 00 Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments Jun 4, 2016 Image Captioning Word Embeddings
— Unverified 00 Automated Report Generation for Lung Cytological Images Using a CNN Vision Classifier and Multiple-Transformer Text Decoders: Preliminary Study Mar 26, 2024 Decoder Image Captioning
— Unverified 00 Automatic Myanmar Image Captioning using CNN and LSTM-Based Language Model May 1, 2020 Image Captioning Language Modeling
— Unverified 00 Automatic Radiology Report Generation based on Multi-view Image Fusion and Medical Concept Enrichment Jul 22, 2019 Decoder Descriptive
— Unverified 00 Auto-Parsing Network for Image Captioning and Visual Question Answering Aug 24, 2021 Image Captioning Question Answering
— Unverified 00 A vision-grounded dataset for predicting typical locations for verbs May 1, 2018 Common Sense Reasoning Image Captioning
— Unverified 00 A Visually-Grounded Parallel Corpus with Phrase-to-Region Linking May 1, 2020 Image Captioning Machine Translation
— Unverified 00 A Weighted Multi-Criteria Decision Making Approach for Image Captioning Mar 17, 2019 Decision Making Image Captioning
— Unverified 00 AZMAT: Sentence Similarity Using Associative Matrices Jun 1, 2015 Image Captioning Semantic Textual Similarity
— Unverified 00 Backdooring Vision-Language Models with Out-Of-Distribution Data Oct 2, 2024 Image Captioning Image to text
— Unverified 00 Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing Oct 23, 2024 Adversarial Attack Backdoor Attack
— Unverified 00