CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes Apr 12, 2023 Question Answering Visual Question Answering
Code Code Available 15 Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education Jul 15, 2024 graph construction Knowledge Graphs
Code Code Available 15 Greedy Gradient Ensemble for Robust Visual Question Answering Jul 27, 2021 Question Answering Visual Question Answering
Code Code Available 15 CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical Reasoning Aug 10, 2022 Math Mathematical Reasoning
Code Code Available 15 Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective Oct 23, 2024 graph construction Knowledge Graphs
Code Code Available 15 Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment Feb 21, 2024 Language Modelling Question Answering
Code Code Available 15 Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation Dec 22, 2021 Common Sense Reasoning Question Answering
Code Code Available 15 Graph Optimal Transport for Cross-Domain Alignment Jun 26, 2020 Graph Matching Image Captioning
Code Code Available 15 Classification-Regression for Chart Comprehension Nov 29, 2021 Chart Question Answering Classification
Code Code Available 15 A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports Sep 3, 2020 Image-text Retrieval Medical Visual Question Answering
Code Code Available 15 CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning Dec 20, 2016 Diagnostic Question Answering
Code Code Available 15 GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question Answering Dec 2, 2024 Question Answering
Code Code Available 15 GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants Feb 12, 2024 Code Generation Management
Code Code Available 15 HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Dec 18, 2023 Question Answering Visual Question Answering
Code Code Available 15 HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation Aug 15, 2021 Descriptive Entity Alignment
Code Code Available 15 Citekit: A Modular Toolkit for Large Language Model Citation Generation Aug 6, 2024 Language Modeling Language Modelling
Code Code Available 15 CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space Feb 18, 2025 Embodied Question Answering Question Answering
Code Code Available 15 Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering Oct 6, 2022 Entity Embeddings Graph Neural Network
Code Code Available 15 ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Mar 26, 2024 Machine Reading Comprehension Optical Character Recognition (OCR)
Code Code Available 15 T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack Dec 22, 2019 Adversarial Attack Adversarial Text
Code Code Available 15 CKBP v2: Better Annotation and Reasoning for Commonsense Knowledge Base Population Apr 20, 2023 Knowledge Base Population Question Answering
Code Code Available 15 Graph Attention Networks Oct 30, 2017 Document Classification Graph Attention
Code Code Available 15 ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding Aug 5, 2022 Image Retrieval Question Answering
Code Code Available 15 Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining Dec 3, 2024 backdoor defense Computational Efficiency
Code Code Available 15 ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models Feb 27, 2025 Question Answering RAG
Code Code Available 15 3D-Aware Visual Question Answering about Parts, Poses and Occlusions Oct 27, 2023 Question Answering Visual Question Answering
Code Code Available 15 CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems Apr 2, 2024 Form Long Form Question Answering
Code Code Available 15 GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering Apr 20, 2021 Graph Neural Network Graph Question Answering
Code Code Available 15 ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification Apr 29, 2025 Diagnostic Question Answering
Code Code Available 15 Check It Again: Progressive Visual Question Answering via Visual Entailment Jun 8, 2021 Question Answering Visual Entailment
Code Code Available 15 DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models Aug 4, 2024 Diagnostic Medical Question Answering
Code Code Available 15 Check It Again:Progressive Visual Question Answering via Visual Entailment Aug 1, 2021 Question Answering Visual Entailment
Code Code Available 15 ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model Feb 20, 2025 Mixture-of-Experts Question Answering
Code Code Available 15 ClarQ: A large-scale and diverse dataset for Clarification Question Generation Jun 10, 2020 Question Answering Question Generation
Code Code Available 15 A Comparative Study of Pretrained Language Models for Long Clinical Text Jan 27, 2023 Clinical Knowledge Document Classification
Code Code Available 15 ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences Nov 10, 2023 Dialogue Generation Language Modeling
Code Code Available 15 Good Questions Help Zero-Shot Image Reasoning Dec 4, 2023 Fine-Grained Image Classification Question Answering
Code Code Available 15 GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering Feb 25, 2019 Question Answering Visual Question Answering (VQA)
Code Code Available 15 Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering Jul 13, 2021 Navigate Question Answering
Code Code Available 15 An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA Sep 10, 2021 Image Captioning Question Answering
Code Code Available 15 GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning Apr 2, 2025 Decision Making Diagnostic
Code Code Available 15 An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling Sep 4, 2022 Fill Mask Optical Flow Estimation
Code Code Available 15 ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning Mar 14, 2024 Chart Understanding Instruction Following
Code Code Available 15 ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering Apr 7, 2025 Chart Question Answering Chart Understanding
Code Code Available 15 CharBERT: Character-aware Pre-trained Language Model Nov 3, 2020 Language Modeling Language Modelling
Code Code Available 15 Advancing High Resolution Vision-Language Models in Biomedicine Jun 12, 2024 Language Modeling Language Modelling
Code Code Available 15 Glance and Focus: Memory Prompting for Multi-Event Video Question Answering Jan 3, 2024 Action Detection Human-Object Interaction Detection
Code Code Available 15 CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds Dec 7, 2024 Question Answering
Code Code Available 15 Change Detection Meets Visual Question Answering Dec 12, 2021 Answer Generation Change Detection
Code Code Available 15 Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension Feb 28, 2024 Language Modeling Language Modelling
Code Code Available 15