Investigating Data Contamination in Modern Benchmarks for Large Language Models Nov 16, 2023 Common Sense Reasoning MMLU
— Unverified 00 Question-to-Question Retrieval for Hallucination-Free Knowledge Access: An Approach for Wikipedia and Wikidata Question Answering Jan 20, 2025 Answer Generation Computational Efficiency
— Unverified 00 CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart Oct 28, 2024 Question Answering
— Unverified 00 Quick and (not so) Dirty: Unsupervised Selection of Justification Sentences for Multi-hop Question Answering Nov 17, 2019 ARC Information Retrieval
— Unverified 00 Investigating Biases in Textual Entailment Datasets Jun 23, 2019 BIG-bench Machine Learning Natural Language Inference
— Unverified 00 Investigating Answerability of LLMs for Long-Form Question Answering Sep 15, 2023 Form Long Form Question Answering
— Unverified 00 CS-VQA: Visual Question Answering with Compressively Sensed Images Jun 8, 2018 Question Answering Visual Question Answering
— Unverified 00 QUINT: Interpretable Question Answering over Knowledge Bases Sep 1, 2017 Named Entity Recognition (NER) Question Answering
— Unverified 00 Automated assessment of knowledge hierarchy evolution: comparing directed acyclic graphs Jun 1, 2019 Knowledge Graph Completion Knowledge Graphs
— Unverified 00 An Augmented Benchmark Dataset for Geometric Question Answering through Dual Parallel Text Encoding Oct 1, 2022 Data Augmentation Math
— Unverified 00 Adopting the Word-Pair-Dependency-Triplets with Individual Comparison for Natural Language Inference Aug 1, 2018 Decision Making Machine Translation
— Unverified 00 Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation Jun 8, 2024 Abstractive Text Summarization Dialogue Generation
— Unverified 00 Inverse Visual Question Answering with Multi-Level Attentions Sep 17, 2019 Question Answering Visual Question Answering
— Unverified 00 QurAna: Corpus of the Quran annotated with Pronominal Anaphora May 1, 2012 Coreference Resolution Information Retrieval
— Unverified 00 Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool Mar 16, 2018 Question Answering Reinforcement Learning
— Unverified 00 Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation Nov 11, 2024 Hallucination Information Retrieval
— Unverified 00 Automated Answer Validation using Text Similarity Jan 13, 2024 Information Retrieval Multiple-choice
— Unverified 00 CSS: Combining Self-training and Self-supervised Learning for Few-shot Dialogue State Tracking Oct 11, 2022 Dialogue State Tracking Machine Reading Comprehension
— Unverified 00 Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs Jul 22, 2019 Knowledge Graphs Question Answering
— Unverified 00 R3: A Reading Comprehension Benchmark Requiring Reasoning Processes Apr 2, 2020 Question Answering Reading Comprehension
— Unverified 00 R3 : Refined Retriever-Reader pipeline for Multidoc2dial May 1, 2022 Conversational Question Answering Decoder
— Unverified 00 Introduction of a Probabilistic Language Model to Non-Factoid Question Answering Using Example Q\&A Pairs Nov 1, 2012 Language Modeling Language Modelling
— Unverified 00 R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models May 4, 2024 Graph Attention Hallucination
— Unverified 00 RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training Oct 18, 2024 Denoising Question Answering
— Unverified 00 CSReader at SemEval-2018 Task 11: Multiple Choice Question Answering as Textual Entailment Jun 1, 2018 Common Sense Reasoning Language Modelling
— Unverified 00 AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types Jun 24, 2020 Anomaly Detection Knowledge Graphs
— Unverified 00 An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Introduction method for argumentative dialogue using paired question-answering interchange about personality Jul 1, 2018 Decision Making Question Answering
— Unverified 00 Introducing Semantics into Speech Encoders Nov 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Introducing RezoJDM16k: a French KnowledgeGraph DataSet for Link Prediction Jun 1, 2022 16k Benchmarking
— Unverified 00 CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task May 17, 2020 Multiple-choice Natural Language Inference
— Unverified 00 RAG based Question-Answering for Contextual Response Prediction System Sep 5, 2024 Prediction Question Answering
— Unverified 00 Introducing "Forecast Utterance" for Conversational Data Science Sep 7, 2023 Prediction Question Answering
— Unverified 00 CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass May 1, 2025 Contrastive Learning Information Retrieval
— Unverified 00 Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis Jul 21, 2024 Question Answering Text Generation
— Unverified 00 CSAT‑FTCN: A Fuzzy‑Oriented Model with Contextual Self‑attention Network for Multimodal Emotion Recognition Jan 31, 2023 Emotion Recognition Multimodal Emotion Recognition
— Unverified 00 AUTOHOME-ORCA at SemEval-2019 Task 8: Application of BERT for Fact-Checking in Community Forums Jun 1, 2019 Community Question Answering Fact Checking
— Unverified 00 A Natural Language Instructor for pedestrian navigation based in generation by selection Apr 1, 2014 Question Answering Text Generation
— Unverified 00 RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning Mar 17, 2025 Answer Generation Multi-hop Question Answering
— Unverified 00 A Domain and Language Independent Named Entity Classification Approach Based on Profiles and Local Information Sep 1, 2017 General Classification Named Entity Recognition (NER)
— Unverified 00 A Coarse to Fine Question Answering System based on Reinforcement Learning Jun 1, 2021 Deep Reinforcement Learning Question Answering
— Unverified 00 Examining Long-Context Large Language Models for Environmental Review Document Comprehension Jul 10, 2024 Question Answering RAG
— Unverified 00 VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos Jun 12, 2025 Question Answering
— Unverified 00 Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Feb 26, 2024 Diversity Question Answering
— Unverified 00 In-the-Wild Video Question Answering Oct 1, 2022 Evidence Selection Question Answering
— Unverified 00 Inter-Weighted Alignment Network for Sentence Pair Modeling Sep 1, 2017 Machine Translation Natural Language Inference
— Unverified 00 Interpreting Questions with a Log-Linear Ranking Model in a Virtual Patient Dialogue System Jun 1, 2015 Question Answering Semantic Parsing
— Unverified 00 CS563-QA: A Collection for Evaluating Question Answering Systems Jul 2, 2019 Natural Language Understanding Question Answering
— Unverified 00 Interpreting Consumer Health Questions: The Role of Anaphora and Ellipsis Aug 1, 2013 Information Retrieval Question Answering
— Unverified 00 Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension Jun 3, 2020 Machine Reading Comprehension Question Answering
— Unverified 00