Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies Jan 6, 2021 Question Answering StrategyQA
Code Code Available 15 Multimodal fusion of imaging and genomics for lung cancer recurrence prediction Feb 5, 2020 Computed Tomography (CT) Question Answering
Code Code Available 15 Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications Feb 1, 2023 Question Answering Representation Learning
Code Code Available 15 Consistency-preserving Visual Question Answering in Medical Imaging Jun 27, 2022 Question Answering Visual Question Answering
Code Code Available 15 Consistency Regularization for Cross-Lingual Fine-Tuning Jun 15, 2021 Machine Translation Question Answering
Code Code Available 15 MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding Apr 26, 2021 Generalized Referring Expression Comprehension Phrase Grounding
Code Code Available 15 Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise May 2, 2023 counterfactual Few-Shot Learning
Code Code Available 15 Contextualized Sparse Representations for Real-Time Open-Domain Question Answering Nov 7, 2019 Information Retrieval Open-Domain Question Answering
Code Code Available 15 Learning Video Context as Interleaved Multimodal Sequences Jul 31, 2024 Language Modeling Language Modelling
Code Code Available 15 Discourse Analysis via Questions and Answers: Parsing Dependency Structures of Questions Under Discussion Oct 12, 2022 Dependency Parsing Question Answering
Code Code Available 15 Discovering Spatio-Temporal Rationales for Video Question Answering Jul 22, 2023 Question Answering Video Question Answering
Code Code Available 15 ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax Mar 2, 2023 Descriptive Image Captioning
Code Code Available 15 Multi-Relational Embedding for Knowledge Graph Representation and Analysis Sep 28, 2020 Computational Efficiency Graph Embedding
Code Code Available 15 Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps Nov 2, 2020 Multi-hop Question Answering Question Answering
Code Code Available 15 Disentangling 3D Prototypical Networks For Few-Shot Concept Learning Nov 6, 2020 3D geometry 3D Object Detection
Code Code Available 15 MultiSpanQA: A Dataset for Multi-Span Question Answering Jul 1, 2022 Natural Questions Question Answering
Code Code Available 15 DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering Nov 10, 2022 counterfactual Data Augmentation
Code Code Available 15 Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs Apr 15, 2024 Hallucination Language Modeling
Code Code Available 15 Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation Nov 1, 2021 Open-Domain Question Answering Question Answering
Code Code Available 15 Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence Annotation Oct 10, 2021 Open-Domain Question Answering Question Answering
Code Code Available 15 Distilled Dual-Encoder Model for Vision-Language Understanding Dec 16, 2021 Image to text model
Code Code Available 15 Distilling Knowledge from Reader to Retriever for Question Answering Dec 8, 2020 Information Retrieval Knowledge Distillation
Code Code Available 15 Contrast and Classify: Training Robust VQA Models Oct 13, 2020 Contrastive Learning Data Augmentation
Code Code Available 15 Does Vision-and-Language Pretraining Improve Lexical Grounding? Sep 21, 2021 Question Answering Visual Question Answering
Code Code Available 15 A Survey of Medical Vision-and-Language Applications and Their Techniques Nov 19, 2024 Decision Making Diagnostic
Code Code Available 15 Diversify Question Generation with Retrieval-Augmented Style Transfer Oct 23, 2023 Diversity Question Answering
Code Code Available 15 Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training Jan 1, 2023 3D dense captioning 3D visual grounding
Code Code Available 15 Context-Aware Answer Extraction in Question Answering Nov 5, 2020 Multi-Task Learning Prediction
Code Code Available 15 Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA May 13, 2020 Image Captioning Multi-Label Classification
Code Code Available 15 Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Sep 19, 2023 Instruction Following Language Modeling
Code Code Available 15 Code-Style In-Context Learning for Knowledge-Based Question Answering Sep 9, 2023 Code Generation In-Context Learning
Code Code Available 15 DocNLI: A Large-scale Dataset for Document-level Natural Language Inference Jun 17, 2021 Natural Language Inference Question Answering
Code Code Available 15 Hierarchical multimodal transformers for Multi-Page DocVQA Dec 7, 2022 Decoder Question Answering
Code Code Available 15 MedChatZH: a Better Medical Adviser Learns from Better Instructions Sep 3, 2023 Question Answering
Code Code Available 15 MemeCap: A Dataset for Captioning and Interpreting Memes May 23, 2023 Image Captioning Meme Captioning
Code Code Available 15 DocVQA: A Dataset for VQA on Document Images Jul 1, 2020 Question Answering Reading Comprehension
Code Code Available 15 Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression May 22, 2025 Hallucination Image Description
Code Code Available 15 OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Apr 11, 2024 Bias Detection Fairness
Code Code Available 15 Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge Jan 19, 2024 Question Answering Question Generation
Code Code Available 15 SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models Oct 12, 2022 Object Question Answering
Code Code Available 15 Ask Me Anything: Dynamic Memory Networks for Natural Language Processing Jun 24, 2015 General Classification Part-Of-Speech Tagging
Code Code Available 05 MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning Feb 27, 2024 8k Language Modeling
Code Code Available 05 CODAH: An Adversarially-Authored Question Answering Dataset for Common Sense Jun 1, 2019 Common Sense Reasoning Question Answering
Code Code Available 05 Matching Article Pairs with Graphical Decomposition and Convolutions Feb 21, 2018 Articles document understanding
Code Code Available 05 MatchZoo: A Learning, Practicing, and Developing System for Neural Text Matching May 24, 2019 Information Retrieval Question Answering
Code Code Available 05 Alloprof: a new French question-answer education dataset and its use in an information retrieval case study Feb 10, 2023 Information Retrieval Question Answering
Code Code Available 05 MatchZoo: A Toolkit for Deep Text Matching Jul 23, 2017 Ad-Hoc Information Retrieval Information Retrieval
Code Code Available 05 Co-attending Regions and Detections with Multi-modal Multiplicative Embedding for VQA Nov 18, 2017 Form Question Answering
Code Code Available 05 Masking Orchestration: Multi-task Pretraining for Multi-role Dialogue Representation Learning Feb 27, 2020 Dialogue Understanding Question Answering
Code Code Available 05 Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding Mar 18, 2025 document understanding Question Answering
Code Code Available 05