Hand1000: Generating Realistic Hands from Text with Only 1,000 Images Aug 28, 2024 Anatomy Gesture Recognition
— Unverified 0CLAIR: Evaluating Image Captions with Large Language Models Oct 19, 2023 Diversity Image Captioning
— Unverified 0Improving Image Captioning with Better Use of Caption Jul 1, 2020 Caption Generation Image Captioning
— Unverified 0Harvesting Information from Captions for Weakly Supervised Semantic Segmentation May 16, 2019 Image Captioning Image Segmentation
— Unverified 0Controllable Image Captioning via Prompting Dec 4, 2022 controllable image captioning Image Captioning
— Unverified 0Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder Oct 31, 2019 Decoder Image Captioning
— Unverified 0Aligning Attention Distribution to Information Flow for Hallucination Mitigation in Large Vision-Language Models May 20, 2025 Hallucination Image Captioning
— Unverified 0Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images Oct 16, 2024 Image Captioning Object
— Unverified 0Astrea: A MOE-based Visual Understanding Model with Progressive Alignment Mar 12, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Hierarchical LSTMs with Adaptive Attention for Visual Captioning Dec 26, 2018 Caption Generation Image Captioning
— Unverified 0Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image Captioning Dec 14, 2023 cross-modal alignment Decoder
— Unverified 0Hierarchical Prototype Learning for Zero-Shot Recognition Oct 24, 2019 Attribute Image Captioning
— Unverified 0Exploring External Knowledge for Accurate modeling of Visual and Language Problems Jan 27, 2023 Image Captioning Machine Translation
— Unverified 0Towards Explainable Neural-Symbolic Visual Reasoning Sep 19, 2019 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
— Unverified 0Exploring Explicit and Implicit Visual Relationships for Image Captioning May 6, 2021 Decoder Image Captioning
— Unverified 0HMGIE: Hierarchical and Multi-Grained Inconsistency Evaluation for Vision-Language Data Cleansing Dec 7, 2024 Answer Generation Graph Generation
— Unverified 0CIC: A Framework for Culturally-Aware Image Captioning Feb 8, 2024 Descriptive Image Captioning
— Unverified 0HorNet: A Hierarchical Offshoot Recurrent Network for Improving Person Re-ID via Image Captioning Aug 14, 2019 Generative Adversarial Network Image Captioning
— Unverified 0How Can Objects Help Video-Language Understanding? Apr 10, 2025 Image Captioning Object
— Unverified 0How Culturally Aware are Vision-Language Models? May 24, 2024 Image Captioning
— Unverified 0HOW IMPORTANT ARE NETWORK WEIGHTS? TO WHAT EXTENT DO THEY NEED AN UPDATE? Jan 1, 2020 Image Captioning
— Unverified 0Improving Diversity and Reducing Redundancy in Paragraph Captions Jul 19, 2020 Decoder Dense Captioning
— Unverified 0Chittron: An Automatic Bangla Image Captioning System Sep 2, 2018 Caption Generation Image Captioning
— Unverified 0Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models Feb 24, 2025 Hallucination Image Captioning
— Unverified 0How Vision Affects Language: Comparing Masked Self-Attention in Uni-Modal and Multi-Modal Transformer Jun 1, 2021 Image Captioning Machine Translation
— Unverified 0How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey Dec 11, 2024 Image Captioning Question Answering
— Unverified 0Cheap-fake Detection with LLM using Prompt Engineering Jun 5, 2023 Image Captioning Image Generation
— Unverified 0Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review Jun 28, 2024 Active Learning Image Captioning
— Unverified 0Can Machines Imitate Humans? Integrative Turing Tests for Vision and Language Demonstrate a Narrowing Gap Nov 23, 2022 Image Captioning object-detection
— Unverified 0Hyperparameter Analysis for Image Captioning Jun 19, 2020 Image Captioning Sensitivity
— Unverified 0AdaDARE-gamma: Balancing Stability and Plasticity in Multi-modal LLMs through Efficient Adaptation Jan 1, 2025 Image Captioning Question Answering
— Unverified 0I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation Mar 20, 2017 Caption Generation Data Augmentation
— Unverified 0Improving Generalization of Image Captioning with Unsupervised Prompt Learning Aug 5, 2023 Attribute Image Captioning
— Unverified 0Exploring and Distilling Cross-Modal Information for Image Captioning Feb 28, 2020 Attribute Decoder
— Unverified 0Explore and Tell: Embodied Visual Captioning in 3D Environments Aug 21, 2023 Image Captioning Navigate
— Unverified 0A Novel Technique for Evidence based Conditional Inference in Deep Neural Networks via Latent Feature Perturbation Nov 24, 2018 Image Captioning Instance Segmentation
— Unverified 0CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning Oct 27, 2023 Image Captioning
— Unverified 0IcoCap: Improving Video Captioning by Compounding Images Oct 5, 2023 Image Captioning Video Captioning
— Unverified 0Exploiting Pseudo Image Captions for Multimodal Summarization May 9, 2023 Common Sense Reasoning Contrastive Learning
— Unverified 0Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment May 20, 2023 Image Captioning Translation
— Unverified 0Assisting Scene Graph Generation with Self-Supervision Aug 8, 2020 Graph Generation Image Captioning
— Unverified 0IIHT: Medical Report Generation with Image-to-Indicator Hierarchical Transformer Aug 10, 2023 Image Captioning Machine Translation
— Unverified 0Exploiting Image–Text Synergy for Contextual Image Captioning Apr 1, 2021 Articles Image Captioning
— Unverified 0Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning Oct 22, 2021 Image Captioning Informativeness
— Unverified 0Explainable Image Captioning using CNN- CNN architecture and Hierarchical Attention Jun 28, 2024 Caption Generation Decoder
— Unverified 0Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models Nov 8, 2024 Image Captioning Image Generation
— Unverified 0Assessing Image Quality Issues for Real-World Problems Mar 27, 2020 Image Captioning Question Answering
— Unverified 0Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning Sep 20, 2023 Audio captioning Caption Generation
— Unverified 0Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks Jan 1, 2023 Cross-Modal Retrieval Image Captioning
— Unverified 0Improving cognitive diagnostics in pathology: a deep learning approach for augmenting perceptional understanding of histopathology images Mar 10, 2025 Diagnostic Image Captioning
— Unverified 0