Evaluating the Retrieval Component in LLM-Based Question Answering Systems Jun 10, 2024 Information Retrieval Question Answering
— Unverified 00 Evaluating the Representational Hub of Language and Vision Models Apr 12, 2019 Diagnostic Question Answering
— Unverified 00 Can Transformers Reason About Effects of Actions? Dec 17, 2020 Common Sense Reasoning Question Answering
— Unverified 00 Evaluating the Performance of ChatGPT for Spam Email Detection Feb 23, 2024 In-Context Learning Question Answering
— Unverified 00 Evaluating the Performance and Robustness of LLMs in Materials Science Q&A and Property Predictions Sep 22, 2024 Band Gap In-Context Learning
— Unverified 00 Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Apr 4, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 00 Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering Feb 14, 2025 Mathematical Reasoning Object
— Unverified 00 Evaluating the Effect of Retrieval Augmentation on Social Biases Feb 24, 2025 Large Language Model Question Answering
— Unverified 00 Do Question Answering Modeling Improvements Hold Across Benchmarks? Feb 1, 2021 Question Answering
— Unverified 00 Can SAR improve RSVQA performance? Aug 28, 2024 Question Answering Visual Question Answering
— Unverified 00 Evaluating the Capabilities of Multi-modal Reasoning Models with Synthetic Task Data Jun 1, 2023 Anomaly Detection Image Generation
— Unverified 00 Evaluating Text Segmentation using Boundary Edit Distance Aug 1, 2013 Information Retrieval Question Answering
— Unverified 00 ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering May 11, 2024 Chart Question Answering Question Answering
— Unverified 00 Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis Apr 17, 2024 Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
— Unverified 00 Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models Oct 17, 2024 Language Modelling Large Language Model
— Unverified 00 Evaluating Recognizing Question Entailment Methods for a Portuguese Community Question-Answering System about Diabetes Mellitus Sep 1, 2021 Community Question Answering Information Retrieval
— Unverified 00 Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap Nov 1, 2021 Data Augmentation Question Answering
— Unverified 00 A Procedural Definition of Multi-word Lexical Units Sep 1, 2015 Machine Translation Question Answering
— Unverified 00 AiFu at SemEval-2019 Task 10: A Symbolic and Sub-symbolic Integrated System for SAT Math Question Answering Jun 1, 2019 Math Question Answering
— Unverified 00 Abductive Matching in Question Answering Sep 10, 2017 BIG-bench Machine Learning Question Answering
— Unverified 00 Evaluating Question Answering Evaluation Nov 1, 2019 Answer Generation Multiple-choice
— Unverified 00 Can Pre-training help VQA with Lexical Variations? Nov 1, 2020 Question Answering Visual Question Answering
— Unverified 00 A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images Oct 1, 2018 Coreference Resolution Image Classification
— Unverified 00 Evaluating Multi-focus Natural Language Queries over Data Services May 1, 2012 Natural Language Queries Question Answering
— Unverified 00 Can predicate-argument relationships be extracted from UD trees? Nov 1, 2021 Question Answering Semantic Role Labeling
— Unverified 00 A Probabilistic-Logic based Commonsense Representation Framework for Modelling Inferences with Multiple Antecedents and Varying Likelihoods Nov 30, 2022 Knowledge Graphs Question Answering
— Unverified 00 AIDA: Artificial Intelligent Dialogue Agent Aug 1, 2013 Dialogue Management Question Answering
— Unverified 00 Evaluating Machine Reading Systems through Comprehension Tests May 1, 2012 Answer Selection Multiple-choice
— Unverified 00 Can Open Domain Question Answering Systems Answer Visual Knowledge Questions? Feb 9, 2022 Open-Domain Question Answering Question Answering
— Unverified 00 Evaluating Machine Common Sense via Cloze Testing Jan 19, 2022 Common Sense Reasoning Open-Ended Question Answering
— Unverified 00 Evaluating LLMs on Document-Based QA: Exact Answer Selection and Numerical Extraction using Cogtale dataset Nov 14, 2023 Answer Selection Information Retrieval
— Unverified 00 A Probabilistic Lexical Model for Ranking Textual Inferences Jul 1, 2012 model Natural Language Inference
— Unverified 00 Evaluating LLMs Capabilities Towards Understanding Social Dynamics Nov 20, 2024 Prompt Engineering Question Answering
— Unverified 00 A Probabilistic Annotation Model for Crowdsourcing Coreference Oct 1, 2018 Coreference Resolution model
— Unverified 00 AIA-BDE: A Corpus of FAQs in Portuguese and their Variations May 1, 2020 Information Retrieval Natural Language Inference
— Unverified 00 Can Multimodal LLMs do Visual Temporal Understanding and Reasoning? The answer is No! Jan 18, 2025 Multiple-choice Question Answering
— Unverified 00 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness Apr 7, 2025 Knowledge Graphs Language Modeling
— Unverified 00 A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base Nov 1, 2021 Knowledge Base Question Answering Question Answering
— Unverified 00 Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis Mar 18, 2024 In-Context Learning Question Answering
— Unverified 00 Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent Dec 7, 2024 Hallucination Question Answering
— Unverified 00 Evaluating Feature Extraction Methods for Knowledge-based Biomedical Word Sense Disambiguation Aug 1, 2017 Dimensionality Reduction Information Retrieval
— Unverified 00 Can LLMs assist with Ambiguity? A Quantitative Evaluation of various Large Language Models on Word Sense Disambiguation Nov 27, 2024 Information Retrieval Part-Of-Speech Tagging
— Unverified 00 A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? Sep 23, 2024 Hallucination MedQA
— Unverified 00 Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios Feb 27, 2025 Data Integration Question Answering
— Unverified 00 Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering Oct 20, 2024 Language Modelling Large Language Model
— Unverified 00 Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? May 27, 2024 Question Answering
— Unverified 00 A Practical Entity Linking System for Tables in Scientific Literature Jun 12, 2023 Entity Linking Knowledge Graphs
— Unverified 00 A Practical 2-step Approach to Assist Enterprise Question-Answering Live Chat Jul 1, 2021 Community Question Answering Question Answering
— Unverified 00 Actively Seeking and Learning from Live Data Apr 5, 2019 Domain Adaptation Meta-Learning
— Unverified 00 Who Taught You That? Tracing Teachers in Model Distillation Feb 10, 2025 Instruction Following POS
— Unverified 00