SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1067610700 of 10817 papers

TitleStatusHype
Reinforcement Learning of Question-Answering Dialogue Policies for Virtual Museum Guides0
Integrating Location, Visibility, and Question-Answering in a Spoken Dialogue System for Pedestrian City Exploration0
Contingency and Comparison Relation Labeling and Structure Prediction in Chinese Sentences0
A Reranking Model for Discourse Segmentation using Subtree Features0
Improving Implicit Discourse Relation Recognition Through Feature Set Optimization0
Learning to Model Multilingual Unrestricted Coreference in OntoNotes0
A New Minimally-Supervised Framework for Domain Word Sense Disambiguation0
Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web0
Learning Constraints for Consistent Timeline Extraction0
Learning Verb Inference Rules from Linguistically-Motivated Evidence0
Collocation Polarity Disambiguation Using Web-based Pseudo Contexts0
Answering Opinion Questions on Products by Exploiting Hierarchical Organization of Consumer Reviews0
Identifying Constant and Unique Relations by using Time-Series Text0
Entity based Q\&A Retrieval0
PATTY: A Taxonomy of Relational Patterns with Semantic Types0
Explore Person Specific Evidence in Web Person Name Disambiguation0
Extracting Opinion Expressions with semi-Markov Conditional Random Fields0
A Novel Discriminative Framework for Sentence-Level Discourse Analysis0
No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities0
Mixed Membership Markov Models for Unsupervised Conversation Modeling0
Why Question Answering using Sentiment Analysis and Word Classes0
SAGAN: An approach to Semantic Textual Similarity based on Textual Entailment0
SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity0
SAGAN: A Machine Translation Approach for Cross-Lingual Textual Entailment0
Semeval-2012 Task 8: Cross-lingual Textual Entailment for Content Synchronization0
Show:102550
← PrevPage 428 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified