SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 93019325 of 10817 papers

TitleStatusHype
Data Augmentation for Visual Question Answering0
Recognizing Textual Entailment in Twitter Using Word Embeddings0
deepCybErNet at EmoInt-2017: Deep Emotion Intensities in Tweets0
Constructing an Alias List for Named Entities during an Event0
Multimedia Summary Generation from Online Conversations: Current Approaches and Future Directions0
Deception Detection in News Reports in the Russian Language: Lexics and Discourse0
Generating titles for millions of browse pages on an e-Commerce site0
Spatial Language Understanding with Multimodal Graphs using Declarative Learning based Programming0
Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation0
Using Question-Answering Techniques to Implement a Knowledge-Driven Argument Mining Approach0
Rhetorical relations markers in Russian RST Treebank0
Sparsity and Noise: Where Knowledge Graph Embeddings Fall ShortCode0
Story Comprehension for Predicting What Happens Next0
Latent Space Embedding for Retrieval in Question-Answer Archives0
From Textbooks to Knowledge: A Case Study in Harvesting Axiomatic Knowledge from Textbooks to Solve Geometry Problems0
No Need to Pay Attention: Simple Recurrent Neural Networks Work!0
Neural Semantic Parsing with Type Constraints for Semi-Structured TablesCode0
Neural Net Models of Open-domain Discourse Coherence0
A Joint Sequential and Relational Model for Frame-Semantic Parsing0
A Question Answering Approach for Emotion Cause Extraction0
Learning Contextually Informed Representations for Linear-Time Discourse Parsing0
CROWD-IN-THE-LOOP: A Hybrid Approach for Annotating Semantic Roles0
Exploiting Morphological Regularities in Distributional Word Representations0
Importance sampling for unbiased on-demand evaluation of knowledge base population0
Classification of telicity using cross-linguistic annotation projectionCode0
Show:102550
← PrevPage 373 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified