ArabianGPT: Native Arabic GPT-based Large Language Model Feb 23, 2024 Language Modeling Language Modelling
— Unverified 0Evaluating the Performance of ChatGPT for Spam Email Detection Feb 23, 2024 In-Context Learning Question Answering
— Unverified 0VISREAS: Complex Visual Reasoning with Unanswerable Questions Feb 23, 2024 Question Answering Visual Question Answering
— Unverified 0Biomedical Entity Linking as Multiple Choice Question Answering Feb 23, 2024 Entity Linking Multiple-choice
Code Code Available 0DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures Feb 23, 2024 Question Answering Text Generation
— Unverified 0Multimodal Transformer With a Low-Computational-Cost Guarantee Feb 23, 2024 Action Recognition Question Answering
— Unverified 0Cost-Adaptive Recourse Recommendation by Adaptive Preference Elicitation Feb 23, 2024 Question Answering
— Unverified 0Do LLMs Implicitly Determine the Suitable Text Difficulty for Users? Feb 22, 2024 Question Answering
Code Code Available 0Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer Feb 22, 2024 Generative Question Answering Hallucination
— Unverified 0CommVQA: Situating Visual Question Answering in Communicative Contexts Feb 22, 2024 Question Answering Visual Question Answering
Code Code Available 0Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond Feb 22, 2024 Form Medical Question Answering
— Unverified 0Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions Feb 21, 2024 Binary Classification Open-Domain Question Answering
— Unverified 0LLMs Meet Long Video: Advancing Long Video Question Answering with An Interactive Visual Adapter in LLMs Feb 21, 2024 Question Answering Video Question Answering
— Unverified 0RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models Feb 21, 2024 Instruction Following Machine Translation
Code Code Available 0PQA: Zero-shot Protein Question Answering for Free-form Scientific Enquiry with Large Language Models Feb 21, 2024 Benchmarking Form
Code Code Available 0Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language Models Feb 21, 2024 Memorization Question Answering
Code Code Available 0FormulaReasoning: A Dataset for Formula-Based Numerical Reasoning Feb 20, 2024 Data Augmentation High School Physics
Code Code Available 0Question Calibration and Multi-Hop Modeling for Temporal Question Answering Feb 20, 2024 Knowledge Graphs Multi-hop Question Answering
— Unverified 0Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data Feb 20, 2024 Question Answering RAG
— Unverified 0Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions Feb 20, 2024 Image Captioning Question Answering
— Unverified 0VideoPrism: A Foundational Visual Encoder for Video Understanding Feb 20, 2024 Question Answering Video Question Answering
— Unverified 0Slot-VLM: SlowFast Slots for Video-Language Modeling Feb 20, 2024 Language Modeling Language Modelling
— Unverified 0Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering Feb 20, 2024 Knowledge Graphs Question Answering
— Unverified 0TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness Feb 19, 2024 Fact Checking Question Answering
Code Code Available 0Cofca: A Step-Wise Counterfactual Multi-hop QA benchmark Feb 19, 2024 counterfactual Multi-hop Question Answering
— Unverified 0Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? Feb 19, 2024 Decision Making Memorization
Code Code Available 0Training Table Question Answering via SQL Query Decomposition Feb 19, 2024 Question Answering Semantic Parsing
— Unverified 0RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning Feb 19, 2024 document understanding Medical Diagnosis
— Unverified 0BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence Feb 19, 2024 Question Answering Retrieval
— Unverified 0Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models Feb 19, 2024 Image Captioning Question Answering
— Unverified 0Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs Feb 19, 2024 Fact Checking Question Answering
— Unverified 0Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Feb 19, 2024 Information Retrieval Question Answering
— Unverified 0Question Answering Over Spatio-Temporal Knowledge Graph Feb 18, 2024 Graph Question Answering Knowledge Graphs
— Unverified 0Large Language Models Can Better Understand Knowledge Graphs Than We Thought Feb 18, 2024 Knowledge Graphs Prompt Engineering
— Unverified 0CliqueParcel: An Approach For Batching LLM Prompts That Jointly Optimizes Efficiency And Faithfulness Feb 17, 2024 Question Answering Reading Comprehension
— Unverified 0A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction Feb 17, 2024 Question Answering Transfer Learning
— Unverified 0Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering Feb 17, 2024 Arithmetic Reasoning Mathematical Reasoning
— Unverified 0GenDec: A robust generative Question-decomposition method for Multi-hop reasoning Feb 17, 2024 Multi-hop Question Answering Question Answering
— Unverified 0MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL Feb 16, 2024 Open-Domain Question Answering Question Answering
Code Code Available 0Exploring Hybrid Question Answering via Program-based Prompting Feb 16, 2024 Code Generation Question Answering
— Unverified 0Where is the answer? Investigating Positional Bias in Language Model Knowledge Extraction Feb 16, 2024 Denoising Language Modeling
Code Code Available 0BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering Feb 16, 2024 Open-Domain Question Answering Question Answering
— Unverified 0Assessing biomedical knowledge robustness in large language models by query-efficient sampling attacks Feb 16, 2024 Distractor Generation Question Answering
— Unverified 0Inference to the Best Explanation in Large Language Models Feb 16, 2024 Question Answering
— Unverified 0PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Feb 16, 2024 Language Modeling Language Modelling
— Unverified 0Question-Instructed Visual Descriptions for Zero-Shot Video Question Answering Feb 16, 2024 Language Modeling Language Modelling
Code Code Available 0II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering Feb 16, 2024 Question Answering Triplet
Code Code Available 0Construction of a Syntactic Analysis Map for Yi Shui School through Text Mining and Natural Language Processing Research Feb 16, 2024 graph construction Information Retrieval
— Unverified 0VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models Feb 16, 2024 Adversarial Robustness Language Modelling
— Unverified 0PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering Feb 16, 2024 Question Answering RAG
— Unverified 0