SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 93769400 of 10817 papers

TitleStatusHype
Mining Social Science Publications for Survey Variables0
Evaluating Natural Language Understanding Services for Conversational Question Answering SystemsCode0
Modeling Large-Scale Structured Relationships with Shared Memory for Knowledge Base Completion0
LearningToQuestion at SemEval 2017 Task 3: Ranking Similar Questions by Learning to Rank Using Rich Features0
Redundancy Localization for the Conversationalization of Unstructured Responses0
Delexicalized transfer parsing for low-resource languages using transformed and combined treebanks0
ECNU at SemEval-2017 Task 3: Using Traditional and Deep Learning Methods to Address Community Question Answering Task0
MoRS at SemEval-2017 Task 3: Easy to use SVM in Ranking Tasks0
Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers0
A Multi-strategy Query Processing Approach for Biomedical Question Answering: USTB\_PRIR at BioASQ 2017 Task 5B0
Learned in Translation: Contextualized Word VectorsCode0
HCTI at SemEval-2017 Task 1: Use convolutional neural network to evaluate Semantic Textual Similarity0
EICA Team at SemEval-2017 Task 3: Semantic and Metadata-based Features for Community Question Answering0
Learning to Solve Geometry Problems from Natural Language Demonstrations in Textbooks0
Beihang-MSRA at SemEval-2017 Task 3: A Ranking System with Neural Matching Features for Community Question Answering0
BIT at SemEval-2017 Task 1: Using Semantic Information Space to Evaluate Semantic Textual Similarity0
GW\_QA at SemEval-2017 Task 3: Question Answer Re-ranking on Arabic Fora0
Evaluating Feature Extraction Methods for Knowledge-based Biomedical Word Sense Disambiguation0
Assessing the performance of Olelo, a real-time biomedical question answering application0
Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts0
FA3L at SemEval-2017 Task 3: A ThRee Embeddings Recurrent Neural Network for Question Answering0
bunji at SemEval-2017 Task 3: Combination of Neural Similarity Features and Comment Plausibility Features0
Learning What is Essential in QuestionsCode0
Detecting Asymmetric Semantic Relations in Context: A Case-Study on Hypernymy Detection0
Acquiring Predicate Paraphrases from News Tweets0
Show:102550
← PrevPage 376 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified