SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs Apr 17, 2024 Question Answering Retrieval
Code Code Available 1Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering Apr 16, 2024 Information Retrieval Language Modeling
Code Code Available 1ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence Apr 16, 2024 Question Answering RAG
Code Code Available 1Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs Apr 15, 2024 Hallucination Language Modeling
Code Code Available 1TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition Apr 15, 2024 Natural Language Understanding Question Answering
Code Code Available 1TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding Apr 15, 2024 Question Answering Visual Question Answering (VQA)
Code Code Available 1CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting Apr 13, 2024 Hallucination Knowledge Graphs
Code Code Available 1Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts Apr 12, 2024 Image Captioning Question Answering
Code Code Available 1OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Apr 11, 2024 Bias Detection Fairness
Code Code Available 1CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering Apr 4, 2024 Language Modeling Language Modelling
Code Code Available 1Multi-Granularity Guided Fusion-in-Decoder Apr 3, 2024 Decoder Multi-Task Learning
Code Code Available 1CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems Apr 2, 2024 Form Long Form Question Answering
Code Code Available 1CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes Apr 1, 2024 Causal Discovery Causal Discovery in Video Reasoning
Code Code Available 1TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering Apr 1, 2024 Question Answering Video Question Answering
Code Code Available 1Linguistic Calibration of Long-Form Generations Mar 30, 2024 Decision Making Form
Code Code Available 1Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering Mar 28, 2024 Hallucination In-Context Learning
Code Code Available 1JDocQA: Japanese Document Question Answering Dataset for Generative Language Models Mar 28, 2024 Hallucination Question Answering
Code Code Available 1TriviaHG: A Dataset for Automatic Hint Generation from Factoid Questions Mar 27, 2024 Hint Generation Information Retrieval
Code Code Available 1Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective Mar 27, 2024 Question Answering Visual Question Answering
Code Code Available 1Non-Linear Inference Time Intervention: Improving LLM Truthfulness Mar 27, 2024 Large Language Model Multiple-choice
Code Code Available 1ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages Mar 26, 2024 Machine Reading Comprehension Optical Character Recognition (OCR)
Code Code Available 1ArabicaQA: A Comprehensive Dataset for Arabic Question Answering Mar 26, 2024 Benchmarking Machine Reading Comprehension
Code Code Available 1Attribute First, then Generate: Locally-attributable Grounded Text Generation Mar 25, 2024 Attribute Document Summarization
Code Code Available 1Language Repository for Long Video Understanding Mar 21, 2024 EgoSchema Question Answering
Code Code Available 1Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering Mar 21, 2024 object-detection Object Detection
Code Code Available 1NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens Mar 18, 2024 Benchmarking Question Answering
Code Code Available 1SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant Mar 17, 2024 Language Modelling Question Answering
Code Code Available 1Forward Learning of Graph Neural Networks Mar 16, 2024 Drug Discovery Graph Learning
Code Code Available 1ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning Mar 14, 2024 Chart Understanding Instruction Following
Code Code Available 1Can We Talk Models Into Seeing the World Differently? Mar 14, 2024 Image Captioning Image Classification
Code Code Available 1Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models Mar 14, 2024 Decoder image-classification
Code Code Available 1Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records Mar 14, 2024 Question Answering RAG
Code Code Available 1DAM: Dynamic Adapter Merging for Continual Video QA Learning Mar 13, 2024 Continual Learning image-classification
Code Code Available 1Beyond Memorization: The Challenge of Random Memory Access in Language Models Mar 12, 2024 Memorization Open-Domain Question Answering
Code Code Available 1Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs Mar 12, 2024 Knowledge Graphs Multiple-choice
Code Code Available 1ALaRM: Align Language Models via Hierarchical Rewards Modeling Mar 11, 2024 Long Form Question Answering Machine Translation
Code Code Available 1InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models Mar 11, 2024 Code Generation HumanEval
Code Code Available 1Calibrating Large Language Models Using Their Generations Only Mar 9, 2024 Question Answering Text Generation
Code Code Available 1Can't Remember Details in Long Documents? You Need Some R&R Mar 8, 2024 Question Answering
Code Code Available 1Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought Mar 8, 2024 Language Modeling Language Modelling
Code Code Available 1Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering Mar 8, 2024 Answer Generation Open-Domain Question Answering
Code Code Available 1To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering Mar 4, 2024 MedQA MMLU
Code Code Available 1Brilla AI: AI Contestant for the National Science and Maths Quiz Mar 4, 2024 Math Question Answering
Code Code Available 1CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge Mar 3, 2024 Claim Verification Graph Question Answering
Code Code Available 1Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark Feb 29, 2024 Question Answering
Code Code Available 1Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions Feb 28, 2024 Benchmarking Multiple-choice
Code Code Available 1Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension Feb 28, 2024 Language Modeling Language Modelling
Code Code Available 1Evaluating Very Long-Term Conversational Memory of LLM Agents Feb 27, 2024 Avg Dialogue Generation
Code Code Available 1NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents Feb 27, 2024 Document Classification Language Modeling
Code Code Available 1Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese Feb 27, 2024 General Knowledge Question Answering
Code Code Available 1