SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 94019425 of 10817 papers

TitleStatusHype
NLM\_NIH at SemEval-2017 Task 3: from Question Entailment to Question Similarity for Community Question Answering0
Hitachi at SemEval-2017 Task 12: System for temporal information extraction from clinical notes0
Detecting Asymmetric Semantic Relations in Context: A Case-Study on Hypernymy Detection0
EICA Team at SemEval-2017 Task 3: Semantic and Metadata-based Features for Community Question Answering0
ECNU at SemEval-2017 Task 3: Using Traditional and Deep Learning Methods to Address Community Question Answering Task0
MoRS at SemEval-2017 Task 3: Easy to use SVM in Ranking Tasks0
TakeLab-QA at SemEval-2017 Task 3: Classification Experiments for Answer Retrieval in Community QA0
SwissAlps at SemEval-2017 Task 3: Attention-based Convolutional Neural Network for Community Question AnsweringCode0
SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity0
SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation0
SimBow at SemEval-2017 Task 3: Soft-Cosine Semantic Similarity between Questions for Community Question Answering0
Talla at SemEval-2017 Task 3: Identifying Similar Questions Through Paraphrase Detection0
UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA0
Ways of Asking and Replying in Duplicate Question Detection0
SCIR-QA at SemEval-2017 Task 3: CNN Model Based on Similar and Dissimilar Information between Keywords for Question Similarity0
UPC-USMBA at SemEval-2017 Task 3: Combining multiple approaches for CQA for Arabic0
Learned in Translation: Contextualized Word VectorsCode0
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused EvaluationCode1
MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension0
Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering0
Question Dependent Recurrent Entity Network for Question AnsweringCode0
Hyperbolic Representation Learning for Fast and Efficient Neural Question AnsweringCode0
MatchZoo: A Toolkit for Deep Text MatchingCode0
Adversarial Examples for Evaluating Reading Comprehension SystemsCode0
Improved Bilinear Pooling with CNNs0
Show:102550
← PrevPage 377 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified