SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 75267550 of 10817 papers

TitleStatusHype
Indic-Transformers: An Analysis of Transformer Language Models for Indian LanguagesCode0
Analyzing Sustainability Reports Using Natural Language ProcessingCode0
Reasoning Over History: Context Aware Visual Dialog0
The Devil is in the Details: Evaluating Limitations of Transformer-based Methods for Granular TasksCode0
COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense Question AnsweringCode0
Coarse-to-Fine Query Focused Multi-Document Summarization0
CHIME: Cross-passage Hierarchical Memory Network for Generative Review Question AnsweringCode0
Unsupervised Adaptation of Question Answering Systems via Generative Self-training0
PALM: Pre-training an Autoencoding\&Autoregressive Language Model for Context-conditioned Generation0
Open Domain Question Answering based on Text Enhanced Knowledge Graph with Hyperedge Infusion0
CapWAP: Image Captioning with a Purpose0
On the effectiveness of small, discriminatively pre-trained language representation models for biomedical text mining0
Event Extraction as Multi-turn Question Answering0
Event Extraction as Machine Reading Comprehension0
Can Pre-training help VQA with Lexical Variations?0
Towards Zero-Shot Conditional Summarization with Adaptive Multi-Task Fine-TuningCode0
Multi-Task Learning using Dynamic Task Weighting for Conversational Question Answering0
Enhancing Question Answering by Injecting Ontological Knowledge through Regularization0
Methods for Numeracy-Preserving Word Embeddings0
AnswerFact: Fact Checking in Product Question Answering0
Biomedical Event Extraction as Multi-turn Question AnsweringCode0
DSNDM: Deep Siamese Neural Discourse Model with Attention for Text Pairs Categorization and Ranking0
Do Transformers Dream of Inference, or Can Pretrained Generative Models Learn Implicit Inferential Rules?0
Don’t Read Too Much Into It: Adaptive Computation for Open-Domain Question Answering0
Deeply Embedded Knowledge Representation & Reasoning For Natural Language Question Answering: A Practitioner’s Perspective0
Show:102550
← PrevPage 302 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified