ManyModalQA: Modality Disambiguation and QA over Diverse Inputs Jan 22, 2020 Question Answering Transfer Learning
Code Code Available 15 Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture Nov 22, 2021 Handwritten Text Recognition object-detection
Code Code Available 15 Data Mining in Clinical Trial Text: Transformers for Classification and Question Answering Tasks Jan 30, 2020 Entity Extraction using GAN General Classification
Code Code Available 15 Map-based Modular Approach for Zero-shot Embodied Question Answering May 26, 2024 Embodied Question Answering Navigate
Code Code Available 15 Compositional Semantic Parsing on Semi-Structured Tables Aug 3, 2015 Question Answering Semantic Parsing
Code Code Available 15 MapQA: A Dataset for Question Answering on Choropleth Maps Nov 15, 2022 Articles Question Answering
Code Code Available 15 MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset Nov 1, 2021 Question Answering
Code Code Available 15 Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment Feb 21, 2024 Language Modelling Question Answering
Code Code Available 15 Making Retrieval-Augmented Language Models Robust to Irrelevant Context Oct 2, 2023 Language Modelling Natural Language Inference
Code Code Available 15 Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models Sep 5, 2024 Answer Generation Graph Question Answering
Code Code Available 15 Context Awareness Gate For Retrieval Augmented Generation Nov 25, 2024 Open-Domain Question Answering Question Answering
Code Code Available 15 Making Neural QA as Simple as Possible but not Simpler Mar 14, 2017 Question Answering Reading Comprehension
Code Code Available 15 MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding May 26, 2025 Question Answering Visual Question Answering
Code Code Available 15 DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Jun 1, 2021 Question Answering Retrieval
Code Code Available 15 Declaration-based Prompt Tuning for Visual Question Answering May 5, 2022 Image-text matching Language Modeling
Code Code Available 15 MarkQA: A large scale KBQA dataset with numerical reasoning Oct 24, 2023 Question Answering
Code Code Available 15 DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations Oct 24, 2024 Instruction Following Question Answering
Code Code Available 15 Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder Jun 28, 2025 Image Segmentation Large Language Model
Code Code Available 15 CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph Jun 25, 2024 Knowledge Graph Completion Knowledge Graphs
Code Code Available 15 Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs Apr 15, 2024 Hallucination Language Modeling
Code Code Available 15 Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps Nov 2, 2020 Multi-hop Question Answering Question Answering
Code Code Available 15 Deep Learning Based Text Classification: A Comprehensive Review Apr 6, 2020 BIG-bench Machine Learning Classification
Code Code Available 15 MMXU: A Multi-Modal and Multi-X-ray Understanding Dataset for Disease Progression Feb 17, 2025 Diagnostic Question Answering
Code Code Available 15 Deep Multimodal Neural Architecture Search Apr 25, 2020 Decoder Image-text matching
Code Code Available 15 Learning Video Context as Interleaved Multimodal Sequences Jul 31, 2024 Language Modeling Language Modelling
Code Code Available 15 Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training Jan 1, 2023 3D dense captioning 3D visual grounding
Code Code Available 15 DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering May 2, 2020 Question Answering
Code Code Available 15 ComQA:Compositional Question Answering via Hierarchical Graph Neural Networks Jan 16, 2021 Answer Selection Machine Reading Comprehension
Code Code Available 15 Maintaining Reasoning Consistency in Compositional Visual Question Answering Jan 1, 2022 Question Answering Visual Question Answering
Code Code Available 15 ConceptBert: Concept-Aware Representation for Visual Question Answering Nov 1, 2020 Common Sense Reasoning Question Answering
Code Code Available 15 DELIFT: Data Efficient Language model Instruction Fine Tuning Nov 7, 2024 Language Modeling Language Modelling
Code Code Available 15 Consistency-preserving Visual Question Answering in Medical Imaging Jun 27, 2022 Question Answering Visual Question Answering
Code Code Available 15 Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts Feb 17, 2021 Caption Generation Diversity
Code Code Available 15 A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers May 7, 2021 Evidence Selection Question Answering
Code Code Available 15 Consistency Regularization for Cross-Lingual Fine-Tuning Jun 15, 2021 Machine Translation Question Answering
Code Code Available 15 Controllable Generation from Pre-trained Language Models via Inverse Prompting Mar 19, 2021 Language Modeling Language Modelling
Code Code Available 15 ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers Oct 13, 2021 Logical Reasoning Question Answering
Code Code Available 15 Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA May 13, 2020 Image Captioning Multi-Label Classification
Code Code Available 15 Densely Connected Attention Propagation for Reading Comprehension Nov 10, 2018 All Open-Domain Question Answering
Code Code Available 15 Dense Hierarchical Retrieval for Open-Domain Question Answering Oct 28, 2021 Open-Domain Question Answering Question Answering
Code Code Available 15 Code-Style In-Context Learning for Knowledge-Based Question Answering Sep 9, 2023 Code Generation In-Context Learning
Code Code Available 15 Conformal Language Modeling Jun 16, 2023 Conformal Prediction Language Modeling
Code Code Available 15 Context-Aware Answer Extraction in Question Answering Nov 5, 2020 Multi-Task Learning Prediction
Code Code Available 15 Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner May 19, 2023 Dense Captioning Image Captioning
Code Code Available 15 Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering May 2, 2020 Knowledge Graphs Language Modeling
Code Code Available 15 Connecting Vision and Language with Video Localized Narratives Feb 22, 2023 Question Answering Video Narrative Grounding
Code Code Available 15 Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering Apr 15, 2021 Open-Domain Question Answering Question Answering
Code Code Available 15 Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning Jun 1, 2023 image-classification Image Classification
Code Code Available 15 Detecting Hate Speech in Multi-modal Memes Dec 29, 2020 Binary Classification Hate Speech Detection
Code Code Available 15 MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Oct 3, 2024 Code Generation Dialogue Generation
Code Code Available 15