SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 54515500 of 10817 papers

TitleStatusHype
Cooperative Self-training of Machine Reading Comprehension0
Cooperative Reasoning on Knowledge Graph and Corpus: A Multi-agentReinforcement Learning Approach0
DHP Benchmark: Are LLMs Good NLG Evaluators?0
LargePiG: Your Large Language Model is Secretly a Pointer Generator0
Large-Scale Acquisition of Commonsense Knowledge via a Quiz Game on a Dialogue System0
Large-Scale Acquisition of Entailment Pattern Pairs by Exploiting Transitivity0
Image Position Prediction in Multimodal Documents0
Large-scale CCG Induction from the Groningen Meaning Bank0
A topic-aware graph neural network model for knowledge base updating0
Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models0
Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes0
Large-Scale Goodness Polarity Lexicons for Community Question Answering0
Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach0
A Deep Learning Approach for Expert Identification in Question Answering Communities0
Large-Scale Paraphrasing for Natural Language Understanding0
Large Scale Question Answering using Tourism Data0
Large Scale Question Paraphrase Retrieval with Smoothed Deep Metric Learning0
Image Captioning with Compositional Neural Module Networks0
Large Scale Scene Text Verification with Guided Attention0
Large-Scale Semantic Indexing and Question Answering in Biomedicine0
Cooperative Denoising for Distantly Supervised Relation Extraction0
A tool suite for creating question answering benchmarks0
Large Vision-Language Models for Remote Sensing Visual Question Answering0
LARSA22 at Qur’an QA 2022: Text-to-Text Transformer for Finding Answers to Questions from Qur’an0
Leveraging Frequent Query Substructures to Generate Formal Queries for Complex Question Answering0
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge0
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks0
COOL, a Context Outlooker, and its Application to Question Answering and other Natural Language Processing Tasks0
CLIPPO: Image-and-Language Understanding from Pixels Only0
Bayesian Attention Belief Networks0
Latent Question Reformulation and Information Accumulation for Multi-Hop Machine Reading0
Latent Semantic Tensor Indexing for Community-based Question Answering0
Cooking with Semantics0
Latent Trees for Coreference Resolution0
Latent Variable Models for Visual Question Answering0
Latent-Variable PCFGs: Background and Applications0
Analysis of Temporal Expressions Annotated in Clinical Notes0
Leveraging Graph Retrieval-Augmented Generation to Support Learners' Understanding of Knowledge Concepts in MOOCs0
Leveraging Linguistic Structure For Open Domain Information Extraction0
Leveraging Pre-trained Models for Failure Analysis Triplets Generation0
LAVIS: A Library for Language-Vision Intelligence0
Lexical Substitution Dataset for German0
Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data0
Convolutional Neural Network: Text Classification Model for Open Domain Question Answering System0
Bayesian Supervised Domain Adaptation for Short Text Similarity0
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference0
LB-KBQA: Large-language-model and BERT based Knowledge-Based Question and Answering System0
LCQMC:A Large-scale Chinese Question Matching Corpus0
IJCNLP-2017 Task 5: Multi-choice Question Answering in Examinations0
Atomic Fact Decomposition Helps Attributed Question Answering0
Show:102550
← PrevPage 110 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified