SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 87268750 of 10817 papers

TitleStatusHype
Semantic Linking in Convolutional Neural Networks for Answer Sentence Selection0
Structured Alignment Networks for Matching Sentences0
Similarity-Based Reconstruction Loss for Meaning Representation0
The BQ Corpus: A Large-scale Domain-specific Chinese Corpus For Sentence Semantic Equivalence Identification0
Transfer and Multi-Task Learning for Noun--Noun Compound Interpretation0
Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents0
Speed Reading: Learning to Read ForBackward via ShuttleCode0
Uncovering Code-Mixed Challenges: A Framework for Linguistically Driven Question Generation and Neural Based Question Answering0
Sentiment Classification towards Question-Answering with Hierarchical Matching Network0
Spot the Odd Man Out: Exploring the Associative Power of Lexical Resources0
Ranking Paragraphs for Improving Answer Recall in Open-Domain Question AnsweringCode0
Direct optimization of F-measure for retrieval-based personal question answering0
Denoise while Aggregating: Collaborative Learning in Open-Domain Question Answering0
Learning Corresponded Rationales for Text Matching0
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation0
A Qualitative Comparison of CoQA, SQuAD 2.0 and QuACCode0
No One is Perfect: Analysing the Performance of Question Answering Components over the DBpedia Knowledge GraphCode0
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringCode1
ComQA: A Community-sourced Dataset for Complex Factoid Question Answering with Paraphrase Clusters0
Stochastic Answer Networks for SQuAD 2.0Code0
Joint Multitask Learning for Community Question Answering Using Task-Specific Embeddings0
Textually Enriched Neural Module Networks for Visual Question Answering0
Neural Approaches to Conversational AI0
Multimodal Dual Attention Memory for Video Story Question Answering0
A Quantitative Evaluation of Natural Language Question Interpretation for Question Answering Systems0
Show:102550
← PrevPage 350 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified