Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness Jul 2, 2024 Image Captioning Question Answering
— Unverified 00 Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks Nov 24, 2024 Image Captioning Natural Language Understanding
— Unverified 00 Challenges in Region-Specific Image Captioning: A Deep Learning Approach Nov 16, 2021 Deep Learning Image Captioning
— Unverified 00 Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity Jun 28, 2023 Benchmarking Image Captioning
— Unverified 00 CHAM: action recognition using convolutional hierarchical attention model May 9, 2017 Action Recognition Image Captioning
— Unverified 00 Cheap-fake Detection with LLM using Prompt Engineering Jun 5, 2023 Image Captioning Image Generation
— Unverified 00 Chittron: An Automatic Bangla Image Captioning System Sep 2, 2018 Caption Generation Image Captioning
— Unverified 00 CIC: A Framework for Culturally-Aware Image Captioning Feb 8, 2024 Descriptive Image Captioning
— Unverified 00 CLAIR: Evaluating Image Captions with Large Language Models Oct 19, 2023 Diversity Image Captioning
— Unverified 00 CLAMP: Contrastive LAnguage Model Prompt-tuning Dec 4, 2023 Contrastive Learning Image Captioning
— Unverified 00 CLIP-SCGI: Synthesized Caption-Guided Inversion for Person Re-Identification Oct 12, 2024 Image Captioning Person Re-Identification
— Unverified 00 Clue: Cross-modal Coherence Modeling for Caption Generation May 2, 2020 Caption Generation controllable image captioning
— Unverified 00 COCO is "ALL'' You Need for Visual Instruction Fine-tuning Jan 17, 2024 All Image Captioning
— Unverified 00 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Feb 4, 2025 Image Captioning Panoptic Segmentation
— Unverified 00 Cold Fusion: Training Seq2Seq Models Together with Language Models Aug 21, 2017 Image Captioning Language Modeling
— Unverified 00 Combine to Describe: Evaluating Compositional Generalization in Image Captioning May 1, 2022 Image Captioning
— Unverified 00 ComicsPAP: understanding comic strips by picking the correct panel Mar 11, 2025 Image Captioning Visual Question Answering (VQA)
— Unverified 00 Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning Mar 5, 2023 Image Captioning
— Unverified 00 Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets Jul 14, 2020 Image Captioning Retrieval
— Unverified 00 Comparing Recurrent and Convolutional Architectures for English-Hindi Neural Machine Translation Nov 1, 2017 Decoder Image Captioning
— Unverified 00 Competence-based Multimodal Curriculum Learning for Medical Report Generation Jun 24, 2022 Image Captioning Medical Report Generation
— Unverified 00 Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data Dec 11, 2023 Image Captioning Image-text Retrieval
— Unverified 00 Compressed Image Captioning using CNN-based Encoder-Decoder Framework Apr 28, 2024 Decoder Image Captioning
— Unverified 00 Compressing Visual-linguistic Model via Knowledge Distillation Apr 5, 2021 Image Captioning Knowledge Distillation
— Unverified 00 Trust It or Not: Confidence-Guided Automatic Radiology Report Generation Jun 21, 2021 Decision Making Image Captioning
— Unverified 00 Connecting Language and Vision to Actions Jul 1, 2018 Image Captioning Language Modeling
— Unverified 00 Consensus Graph Representation Learning for Better Grounded Image Captioning Dec 2, 2021 Graph Representation Learning Hallucination
— Unverified 00 Consistency Model is an Effective Posterior Sample Approximation for Diffusion Inverse Solvers Feb 9, 2024 Image Captioning Semantic Segmentation
— Unverified 00 Consistent Multiple Sequence Decoding Apr 2, 2020 Decoder Diversity
— Unverified 00 Context-Aware Group Captioning via Self-Attention and Contrastive Features Apr 7, 2020 Image Captioning
— Unverified 00 Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity Mar 31, 2025 Image Captioning Optical Character Recognition
— Unverified 00 Contextual Emotion Estimation from Image Captions Sep 22, 2023 Image Captioning Language Modelling
— Unverified 00 Contextual Emotion Recognition using Large Vision Language Models May 14, 2024 Decision Making Emotion Recognition
— Unverified 00 Contextualized Keyword Representations for Multi-modal Retinal Image Captioning Apr 26, 2021 Avg Image Captioning
— Unverified 00 Contextual Memory Trees Jul 17, 2018 General Classification Image Captioning
— Unverified 00 Continuous multilinguality with language vectors Apr 1, 2017 Image Captioning Language Modeling
— Unverified 00 Contrastive Learning for Image Captioning Oct 6, 2017 Contrastive Learning Image Captioning
— Unverified 00 Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder Jun 29, 2021 Image Captioning Representation Learning
— Unverified 00 Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Mar 14, 2022 Image Captioning Semantic Textual Similarity
— Unverified 00 Control Image Captioning Spatially and Temporally Aug 1, 2021 Contrastive Learning Image Captioning
— Unverified 00 Controllable Image Captioning Apr 28, 2022 controllable image captioning Decoder
— Unverified 00 Controllable Image Captioning via Prompting Dec 4, 2022 controllable image captioning Image Captioning
— Unverified 00 Controlled Caption Generation for Images Through Adversarial Attacks Jul 7, 2021 Caption Generation Image Captioning
— Unverified 00 Convolutional Prototype Learning for Zero-Shot Recognition Oct 22, 2019 Attribute Image Captioning
— Unverified 00 Cooperative image captioning Jul 26, 2019 Image Captioning
— Unverified 00 Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles May 1, 2012 Articles Image Captioning
— Unverified 00 CPTR: Full Transformer Network for Image Captioning Jan 26, 2021 Decoder Image Captioning
— Unverified 00 CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning Oct 27, 2023 Image Captioning
— Unverified 00 Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment May 20, 2023 Image Captioning Translation
— Unverified 00 CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models May 22, 2024 Benchmarking Hallucination
— Unverified 00