Can you even tell left from right? Presenting a new challenge for VQA Mar 15, 2022 Question Answering Visual Question Answering
— Unverified 00 CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making Jun 15, 2025 Answer Generation Decision Making
— Unverified 00 CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns Oct 2, 2020 Image Captioning object-detection
— Unverified 00 Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment Jul 31, 2023 Action Recognition Blocking
— Unverified 00 CapWAP: Captioning with a Purpose Nov 9, 2020 Image Captioning Question Answering
— Unverified 00 CapWAP: Image Captioning with a Purpose Nov 1, 2020 Image Captioning Question Answering
— Unverified 00 Categorizing Concepts With Basic Level for Vision-to-Language Jun 1, 2018 Clustering Image Captioning
— Unverified 00 Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models Dec 9, 2023 Question Answering Visual Question Answering
— Unverified 00 Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering Oct 9, 2023 Answer Generation Question Answering
— Unverified 00 CAVL: Learning Contrastive and Adaptive Representations of Vision and Language Apr 10, 2023 Image Retrieval Phrase Grounding
— Unverified 00 CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs Dec 3, 2024 Image Captioning Quantization
— Unverified 00 Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness Jul 2, 2024 Image Captioning Question Answering
— Unverified 00 Chain of Reasoning for Visual Question Answering Dec 1, 2018 Object Question Answering
— Unverified 00 Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset Oct 8, 2020 Question Answering Visual Question Answering
— Unverified 00 Characterizing Misclassifications of Deep NLP Models Mar 12, 2021 named-entity-recognition Named Entity Recognition
— Unverified 00 Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations Sep 27, 2024 Chart Question Answering Question Answering
— Unverified 00 ChatBEV: A Visual Language Model that Understands BEV Maps Mar 18, 2025 Autonomous Driving Language Modeling
— Unverified 00 ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models Feb 27, 2025 Person Re-Identification Person Retrieval
— Unverified 00 ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla Oct 19, 2024 Question Answering Visual Question Answering
— Unverified 00 Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads Apr 30, 2021 Question Answering Visual Question Answering
— Unverified 00 CIC: A Framework for Culturally-Aware Image Captioning Feb 8, 2024 Descriptive Image Captioning
— Unverified 00 CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering Nov 19, 2022 Continual Learning Question Answering
— Unverified 00 CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments Mar 5, 2024 Language Modelling Large Language Model
— Unverified 00 CLiF-VQA: Enhancing Video Quality Assessment by Incorporating High-Level Semantic Information related to Human Feelings Nov 13, 2023 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment Mar 14, 2022 parameter-efficient fine-tuning Question Answering
— Unverified 00 CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks Jan 15, 2022 Question Answering Visual Commonsense Reasoning
— Unverified 00 CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering Jan 2, 2025 Multiple-choice Question Answering
— Unverified 00 CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering Mar 1, 2025 Continual Learning Language Modeling
— Unverified 00 CoBIT: A Contrastive Bi-directional Image-Text Generation Model Mar 23, 2023 Decoder Image Generation
— Unverified 00 COCO is "ALL'' You Need for Visual Instruction Fine-tuning Jan 17, 2024 All Image Captioning
— Unverified 00 COIN: Counterfactual Image Generation for VQA Interpretation Jan 10, 2022 counterfactual Image Generation
— Unverified 00 Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering Jan 22, 2025 Knowledge Graphs Question Answering
— Unverified 00 Evaluating and Improving Interactions with Hazy Oracles Oct 19, 2021 Object Tracking Referring Expression
— Unverified 00 ComicsPAP: understanding comic strips by picking the correct panel Mar 11, 2025 Image Captioning Visual Question Answering (VQA)
— Unverified 00 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Jan 9, 2025 Benchmarking Question Answering
— Unverified 00 Compact Tensor Pooling for Visual Question Answering Jun 20, 2017 Question Answering Visual Question Answering
— Unverified 00 Component Analysis for Visual Question Answering Architectures Feb 12, 2020 Question Answering Representation Learning
— Unverified 00 Compositional Attention Networks for Interpretability in Natural Language Question Answering Oct 30, 2018 Logical Reasoning Question Answering
— Unverified 00 Compositional Memory for Visual Question Answering Nov 18, 2015 Question Answering Visual Question Answering
— Unverified 00 Compound Tokens: Channel Fusion for Vision-Language Representation Learning Dec 2, 2022 Decoder Language Modeling
— Unverified 00 Compressing Visual-linguistic Model via Knowledge Distillation Apr 5, 2021 Image Captioning Knowledge Distillation
— Unverified 00 Connecting Language and Vision to Actions Jul 1, 2018 Image Captioning Language Modeling
— Unverified 00 Connecting phases of matter to the flatness of the loss landscape in analog variational quantum algorithms Jun 16, 2025 Visual Question Answering (VQA)
— Unverified 00 Convolutional Neural Networks for Video Quality Assessment Sep 26, 2018 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG Jun 3, 2025 Answer Generation RAG
— Unverified 00 Cost Function Dependent Barren Plateaus in Shallow Parametrized Quantum Circuits Jan 2, 2020 Visual Question Answering (VQA)
— Unverified 00 Counterfactual Vision and Language Learning Jun 1, 2020 counterfactual Question Answering
— Unverified 00 Co-VQA : Answering by Interactive Sub Question Sequence Nov 16, 2021 Question Answering Visual Question Answering
— Unverified 00 Co-VQA : Answering by Interactive Sub Question Sequence Apr 2, 2022 Question Answering Visual Question Answering
— Unverified 00 CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment May 21, 2025 Language Modeling Language Modelling
— Unverified 00