SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 997610000 of 10817 papers

TitleStatusHype
Semantic Parsing to Probabilistic Programs for Situated Question Answering0
Semantic Parsing using Distributional Semantics and Probabilistic Logic0
Semantic Parsing via Paraphrasing0
Semantic Relations in Discourse: The Current State of ISO 24617-80
Semantic Role Labeling of Implicit Arguments for Nominal Predicates0
Semantic role labeling tools for biomedical question answering: a study of selected tools on the BioASQ datasets0
Semantic Roles in Grammar Engineering0
Semantics-based Graph Approach to Complex Question-Answering0
Semantic Search as Extractive Paraphrase Span Detection0
Semantic Parsing for Question Answering over Knowledge Graphs0
Semantic Sentence Composition Reasoning for Multi-Hop Question Answering0
Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information0
Semantic similarity estimation for domain specific data using BERT and other techniques0
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods0
Semantic Structure Analysis of Noun Phrases using Abstract Meaning Representation0
Semantic Structure based Query Graph Prediction for Question Answering over Knowledge Graph0
Semantic Technologies in IBM Watson0
Semantic Textual Similarity: past present and future0
Semantic Textual Similarity with Siamese Neural Networks0
Semantic Tokens in Retrieval Augmented Generation0
Semantic Web Enabled Geographic Question Answering Framework: GeoTR0
SemantiKLUE: Robust Semantic Similarity at Multiple Levels Using Maximum Weight Matching0
SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity0
Semeval-2012 Task 8: Cross-lingual Textual Entailment for Content Synchronization0
SemEval-2013 Task 10: Cross-lingual Word Sense Disambiguation0
Show:102550
← PrevPage 400 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified