SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 84018425 of 10817 papers

TitleStatusHype
Towards Scalable and Reliable Capsule Networks for Challenging NLP ApplicationsCode0
Cross-Lingual Training for Automatic Question GenerationCode0
Towards Interpretable Reinforcement Learning Using Attention Augmented AgentsCode0
A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text MiningCode0
KERMIT: Generative Insertion-Based Modeling for Sequences0
Episodic Memory in Lifelong Language LearningCode0
Generating Question Relevant Captions to Aid Visual Question Answering0
Question Answering as an Automatic Evaluation Metric for News Article SummarizationCode0
CODAH: An Adversarially-Authored Question Answering Dataset for Common SenseCode0
CodeForTheChange at SemEval-2019 Task 8: Skip-Thoughts for Fact Checking in Community Question Answering0
FreebaseQA: A New Factoid QA Data Set Matching Trivia-Style Question-Answer Pairs with FreebaseCode0
Alignment over Heterogeneous Embeddings for Question AnsweringCode0
Fermi at SemEval-2019 Task 8: An elementary but effective approach to Question Discernment in Community QA Forums0
ProblemSolver at SemEval-2019 Task 10: Sequence-to-Sequence Learning and Expression Trees0
Predicting Helpful Posts in Open-Ended Discussion Forums: A Neural Architecture0
Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence0
On Knowledge distillation from complex networks for response prediction0
AiFu at SemEval-2019 Task 10: A Symbolic and Sub-symbolic Integrated System for SAT Math Question Answering0
Natural Questions: a Benchmark for Question Answering Research0
Promotion of Answer Value Measurement with Domain Effects in Community Question Answering Systems0
Enhancing Key-Value Memory Neural Networks for Knowledge Based Question Answering0
DiffQue: Estimating Relative Difficulty of Questions in Community Question Answering ServicesCode0
BLCU\_NLP at SemEval-2019 Task 8: A Contextual Knowledge-enhanced GPT Model for Fact Checking0
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering0
DUTH at SemEval-2019 Task 8: Part-Of-Speech Features for Question Classification0
Show:102550
← PrevPage 337 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified