VISREAS: Complex Visual Reasoning with Unanswerable Questions Feb 23, 2024 Question Answering Visual Question Answering
— Unverified 0Evaluating the Performance of ChatGPT for Spam Email Detection Feb 23, 2024 In-Context Learning Question Answering
— Unverified 0ArabianGPT: Native Arabic GPT-based Large Language Model Feb 23, 2024 Language Modeling Language Modelling
— Unverified 0Multimodal Transformer With a Low-Computational-Cost Guarantee Feb 23, 2024 Action Recognition Question Answering
— Unverified 0Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models Feb 23, 2024 In-Context Learning Knowledge Base Question Answering
Code Code Available 1Biomedical Entity Linking as Multiple Choice Question Answering Feb 23, 2024 Entity Linking Multiple-choice
Code Code Available 0Cost-Adaptive Recourse Recommendation by Adaptive Preference Elicitation Feb 23, 2024 Question Answering
— Unverified 0Faithful Temporal Question Answering over Heterogeneous Sources Feb 23, 2024 Question Answering
— Unverified 0SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials Feb 22, 2024 Chart Question Answering Language Modeling
Code Code Available 1Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education Feb 22, 2024 Question Answering Text Generation
Code Code Available 1CommVQA: Situating Visual Question Answering in Communicative Contexts Feb 22, 2024 Question Answering Visual Question Answering
Code Code Available 0Visual Hallucinations of Multi-modal Large Language Models Feb 22, 2024 Diversity Hallucination
Code Code Available 1Do LLMs Implicitly Determine the Suitable Text Difficulty for Users? Feb 22, 2024 Question Answering
Code Code Available 0Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer Feb 22, 2024 Generative Question Answering Hallucination
— Unverified 0Uncertainty-Aware Evaluation for Vision-Language Models Feb 22, 2024 Conformal Prediction Language Modeling
Code Code Available 1Data Science with LLMs and Interpretable Models Feb 22, 2024 Additive models Question Answering
Code Code Available 2Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond Feb 22, 2024 Form Medical Question Answering
— Unverified 0Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering Feb 22, 2024 Knowledge Base Question Answering Question Answering
Code Code Available 1ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents Feb 21, 2024 Active Learning Position
Code Code Available 2Learning to Poison Large Language Models for Downstream Manipulation Feb 21, 2024 Data Poisoning In-Context Learning
Code Code Available 1FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models Feb 21, 2024 Question Answering
Code Code Available 2Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language Models Feb 21, 2024 Memorization Question Answering
Code Code Available 0RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models Feb 21, 2024 Instruction Following Machine Translation
Code Code Available 0Towards Building Multilingual Language Model for Medicine Feb 21, 2024 Domain Adaptation Language Modeling
Code Code Available 3PQA: Zero-shot Protein Question Answering for Free-form Scientific Enquiry with Large Language Models Feb 21, 2024 Benchmarking Form
Code Code Available 0Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions Feb 21, 2024 Binary Classification Open-Domain Question Answering
— Unverified 0LLMs Meet Long Video: Advancing Long Video Question Answering with An Interactive Visual Adapter in LLMs Feb 21, 2024 Question Answering Video Question Answering
— Unverified 0Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment Feb 21, 2024 Language Modelling Question Answering
Code Code Available 1Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions Feb 20, 2024 Image Captioning Question Answering
— Unverified 0DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain Feb 20, 2024 named-entity-recognition Named Entity Recognition
Code Code Available 1Question Calibration and Multi-Hop Modeling for Temporal Question Answering Feb 20, 2024 Knowledge Graphs Multi-hop Question Answering
— Unverified 0BiMediX: Bilingual Medical Mixture of Experts LLM Feb 20, 2024 Mixture-of-Experts Multiple-choice
Code Code Available 1Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data Feb 20, 2024 Question Answering RAG
— Unverified 0Slot-VLM: SlowFast Slots for Video-Language Modeling Feb 20, 2024 Language Modeling Language Modelling
— Unverified 0FormulaReasoning: A Dataset for Formula-Based Numerical Reasoning Feb 20, 2024 Data Augmentation High School Physics
Code Code Available 0FinBen: A Holistic Financial Benchmark for Large Language Models Feb 20, 2024 Question Answering RAG
Code Code Available 4Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering Feb 20, 2024 Knowledge Graphs Question Answering
— Unverified 0VideoPrism: A Foundational Visual Encoder for Video Understanding Feb 20, 2024 Question Answering Video Question Answering
— Unverified 0Benchmarking Retrieval-Augmented Generation for Medicine Feb 20, 2024 Benchmarking Information Retrieval
Code Code Available 4RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning Feb 19, 2024 document understanding Medical Diagnosis
— Unverified 0Training Table Question Answering via SQL Query Decomposition Feb 19, 2024 Question Answering Semantic Parsing
— Unverified 0Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? Feb 19, 2024 Decision Making Memorization
Code Code Available 0TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness Feb 19, 2024 Fact Checking Question Answering
Code Code Available 0Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs Feb 19, 2024 Fact Checking Question Answering
— Unverified 0Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs Feb 19, 2024 Question Answering
Code Code Available 2Cofca: A Step-Wise Counterfactual Multi-hop QA benchmark Feb 19, 2024 counterfactual Multi-hop Question Answering
— Unverified 0Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models Feb 19, 2024 Image Captioning Question Answering
— Unverified 0BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence Feb 19, 2024 Question Answering Retrieval
— Unverified 0Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Feb 19, 2024 Information Retrieval Question Answering
— Unverified 0MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs Feb 19, 2024 Question Answering
Code Code Available 1