PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology Aug 13, 2024 Image Captioning
— Unverified 0Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention Jun 26, 2017 Image Captioning Saliency Prediction
— Unverified 0Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis Dec 4, 2024 Image Captioning Image Description
— Unverified 0PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training Mar 9, 2025 Hallucination Image Captioning
— Unverified 0phi-LSTM: A Phrase-based Hierarchical LSTM Model for Image Captioning Aug 20, 2016 Image Captioning Image Description
— Unverified 0Phrase-based Image Captioning Feb 12, 2015 Descriptive Image Captioning
— Unverified 0Phrase-based Image Captioning with Hierarchical LSTM Model Nov 11, 2017 Decoder Image Captioning
— Unverified 0Physically Grounded Vision-Language Models for Robotic Manipulation Sep 5, 2023 Image Captioning Language Modelling
— Unverified 0PICS: Pipeline for Image Captioning and Search Feb 1, 2024 Asset Management Image Captioning
— Unverified 0Pixels to Prose: Understanding the art of Image Captioning Aug 28, 2024 Descriptive Image Captioning
— Unverified 0Pointing Novel Objects in Image Captioning Apr 25, 2019 Decoder Image Captioning
— Unverified 0PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation Sep 10, 2024 Image Captioning Image Generation
— Unverified 0Pragmatically Informative Image Captioning with Character-Level Inference Apr 15, 2018 Image Captioning Rolling Shutter Correction
— Unverified 0Predicting Visual Futures with Image Captioning and Pre-Trained Language Models Jun 16, 2021 Image Captioning
— Unverified 0Predicting Word Learning in Children from the Performance of Computer Vision Systems Jul 7, 2022 Image Captioning
— Unverified 0Predictive linguistic cues for fake news: a societal artificial intelligence problem Nov 26, 2022 Attribute Image Captioning
— Unverified 0Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning Sep 10, 2023 Denoising Diversity
— Unverified 0PreSTU: Pre-Training for Scene-Text Understanding Sep 12, 2022 Decoder Image Captioning
— Unverified 0PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning Mar 15, 2023 Image Captioning
— Unverified 0Probing Cross-modal Semantics Alignment Capability from the Textual Perspective Oct 18, 2022 Image Captioning Sentence
— Unverified 0Progress-Aware Video Frame Captioning Dec 3, 2024 Image Captioning Video Captioning
— Unverified 0Prompt-based Learning for Unpaired Image Captioning May 26, 2022 Image Captioning Image-text Retrieval
— Unverified 0PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3 Jan 1, 2023 Image Captioning Question Answering
— Unverified 0PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks Jan 30, 2023 Crowd Counting Data Augmentation
— Unverified 0Prompt Tuning for Generative Multimodal Pretrained Models Aug 4, 2022 Image Captioning Visual Entailment
— Unverified 0Prophet Attention: Predicting Attention with Future Attention Dec 1, 2020 Image Captioning
— Unverified 0Prophet Attention: Predicting Attention with Future Attention for Image Captioning Oct 19, 2022 Image Captioning
— Unverified 0PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension Dec 16, 2024 Benchmarking Image Captioning
— Unverified 0Putting Humans in the Image Captioning Loop Jun 6, 2023 Image Captioning
— Unverified 0Quality-agnostic Image Captioning to Safely Assist People with Vision Impairment Apr 28, 2023 Data Augmentation Image Captioning
— Unverified 0Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval Oct 2, 2024 Image Captioning Retrieval
— Unverified 0RadTex: Learning Efficient Radiograph Representations from Text Reports Aug 5, 2022 Classification Decoder
— Unverified 0RAVEN: Multitask Retrieval Augmented Vision-Language Learning Jun 27, 2024 Image Captioning RAG
— Unverified 0Reading Radiology Imaging Like The Radiologist Jul 12, 2023 Image Captioning Retrieval
— Unverified 0Recurrent Fusion Network for Image Captioning Jul 26, 2018 Decoder Image Captioning
— Unverified 0Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering Dec 15, 2016 Decoder Image Captioning
— Unverified 0Recurrent Models for Situation Recognition Mar 18, 2017 Grounded Situation Recognition Human-Object Interaction Detection
— Unverified 0Recurrent Relational Memory Network for Unsupervised Image Captioning Jun 24, 2020 Computational Efficiency Image Captioning
— Unverified 0Redemption Score: An Evaluation Framework to Rank Image Captions While Redeeming Image Semantics and Language Pragmatics May 22, 2025 Image Captioning text similarity
— Unverified 0Re-evaluating Automatic Metrics for Image Captioning Dec 22, 2016 Image Captioning
— Unverified 0RefineCap: Concept-Aware Refinement for Image Captioning Sep 8, 2021 Decoder Descriptive
— Unverified 0Reflective Decoding Network for Image Captioning Aug 30, 2019 Decoder Image Captioning
— Unverified 0Reinforcing an Image Caption Generator Using Off-Line Human Feedback Nov 21, 2019 Image Captioning Reinforcement Learning
— Unverified 0Reinforcing Pre-trained Models Using Counterfactual Images Jun 19, 2024 Classification counterfactual
— Unverified 0Relational Reasoning using Prior Knowledge for Visual Captioning Jun 4, 2019 Image Captioning object-detection
— Unverified 0Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping Jan 7, 2022 Image Captioning Image Cropping
— Unverified 0Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes Jul 4, 2024 Image Captioning image-classification
— Unverified 0Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization May 24, 2022 Image Captioning Out-of-Distribution Generalization
— Unverified 0Rethinking the Form of Latent States in Image Captioning Jul 26, 2018 Caption Generation Form
— Unverified 0VrR-VG: Refocusing Visually-Relevant Relationships Feb 1, 2019 Image Captioning Question Answering
— Unverified 0