Im2Text: Describing Images Using 1 Million Captioned Photographs Dec 1, 2011 Image Captioning Image Description
— Unverified 0Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models Mar 12, 2025 Cross-Lingual Transfer Image Captioning
— Unverified 0Flowing from Words to Pixels: A Framework for Cross-Modality Evolution Dec 19, 2024 Depth Estimation Image Captioning
— Unverified 0Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution Jan 1, 2025 Depth Estimation Image Captioning
— Unverified 0A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jun 12, 2023 Image Captioning Machine Translation
— Unverified 0Fluent and Accurate Image Captioning with a Self-Trained Reward Model Aug 29, 2024 Image Captioning Specificity
— Unverified 0Focused Evaluation for Image Description with Binary Forced-Choice Tasks Aug 1, 2016 Image Captioning Image Description
— Unverified 0Focus! Relevant and Sufficient Context Selection for News Image Captioning Dec 1, 2022 Image Captioning Relation Extraction
— Unverified 0FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes Sep 6, 2024 Domain Adaptation Image Captioning
— Unverified 0Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization Apr 14, 2025 Benchmarking Earth Observation
— Unverified 0Attend More Times for Image Captioning Dec 8, 2018 Image Captioning
— Unverified 0FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images Sep 24, 2023 Attribute Caption Generation
— Unverified 0Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg Dec 28, 2021 Image Captioning
— Unverified 0From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models Mar 8, 2025 Image Captioning Language Modeling
— Unverified 0Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning Mar 5, 2023 Image Captioning
— Unverified 0How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey Dec 11, 2024 Image Captioning Question Answering
— Unverified 0From Pixels to Prose: A Large Dataset of Dense Image Captions Jun 14, 2024 Image Captioning
— Unverified 0Aligning Large Multimodal Models with Factually Augmented RLHF Sep 25, 2023 Hallucination Image Captioning
— Unverified 0From Show to Tell: A Survey on Deep Learning-based Image Captioning Jul 14, 2021 Image Captioning Language Modelling
— Unverified 0Comparing Recurrent and Convolutional Architectures for English-Hindi Neural Machine Translation Nov 1, 2017 Decoder Image Captioning
— Unverified 0How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model Nov 10, 2023 Image Captioning Language Modeling
— Unverified 0Exposing and Correcting the Gender Bias in Image Captioning Datasets and Models Dec 2, 2019 Gender Classification Image Captioning
— Unverified 0FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs Sep 20, 2024 Image Captioning Image Comprehension
— Unverified 0Exploring Visual Relationship for Image Captioning Sep 19, 2018 Decoder Image Captioning
— Unverified 0Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Jun 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fusion Models for Improved Visual Captioning Oct 28, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Survey of Evaluation Metrics Used for NLG Systems Aug 27, 2020 Image Captioning nlg evaluation
— Unverified 0How Vision Affects Language: Comparing Masked Self-Attention in Uni-Modal and Multi-Modal Transformer Jun 1, 2021 Image Captioning Machine Translation
— Unverified 0Human Action Adverb Recognition: ADHA Dataset and A Three-Stream Hybrid Model Feb 4, 2018 Action Recognition Image Captioning
— Unverified 0GCS-M3VLT: Guided Context Self-Attention based Multi-modal Medical Vision Language Transformer for Retinal Image Captioning Dec 23, 2024 Image Captioning Language Modeling
— Unverified 0Exploring Visual Culture Awareness in GPT-4V: A Comprehensive Probing Feb 8, 2024 Image Captioning TAG
— Unverified 0AstroLLaVA: towards the unification of astronomical data and natural language Apr 11, 2025 Astronomy Image Captioning
— Unverified 0Generalized Visual Relation Detection with Diffusion Models Apr 16, 2025 Graph Generation Human-Object Interaction Detection
— Unverified 03D Spatial Understanding in MLLMs: Disambiguation and Evaluation Dec 9, 2024 3D dense captioning 3D visual grounding
— Unverified 0Exploring Affordance and Situated Meaning in Image Captions: A Multimodal Analysis May 24, 2023 Image Captioning Natural Language Understanding
— Unverified 0Exploring the Functional and Geometric Bias of Spatial Relations Using Neural Language Models Jun 1, 2018 Image Captioning
— Unverified 0Generating Description for Sequential Images with Local-Object Attention Conditioned on Global Semantic Context Nov 1, 2018 Image Captioning Text Generation
— Unverified 0Generating Diverse and Descriptive Image Captions Using Visual Paraphrases Oct 1, 2019 Descriptive Diversity
— Unverified 0CLAMP: Contrastive LAnguage Model Prompt-tuning Dec 4, 2023 Contrastive Learning Image Captioning
— Unverified 0Generating Diverse and Informative Natural Language Fashion Feedback Jun 15, 2019 Decoder Image Captioning
— Unverified 0HOW IMPORTANT ARE NETWORK WEIGHTS? TO WHAT EXTENT DO THEY NEED AN UPDATE? Jan 1, 2020 Image Captioning
— Unverified 0Generating image captions with external encyclopedic knowledge Oct 10, 2022 Caption Generation Image Captioning
— Unverified 0Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions Feb 20, 2024 Image Captioning Question Answering
— Unverified 0Attention Strategies for Multi-Source Sequence-to-Sequence Learning Jul 1, 2017 Automatic Post-Editing Image Captioning
— Unverified 0Exploring Spatial Language Grounding Through Referring Expressions Feb 4, 2025 Image Captioning Negation
— Unverified 0Generating Natural Language Descriptions for Semantic Representations of Human Brain Activity Aug 1, 2016 Image Captioning
— Unverified 0Generating Triples with Adversarial Networks for Scene Graph Construction Feb 7, 2018 Attribute graph construction
— Unverified 0Generating Video Descriptions with Topic Guidance Aug 31, 2017 Decoder Image Captioning
— Unverified 0Connecting Language and Vision to Actions Jul 1, 2018 Image Captioning Language Modeling
— Unverified 0Exploring Semantic Relationships for Unpaired Image Captioning Jun 20, 2021 Image Captioning Sentence
— Unverified 0