SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1065110700 of 10817 papers

TitleStatusHype
Towards Two-step Multi-document Summarisation for Evidence Based Medicine: A Quantitative Analysis0
The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval0
Simple or Complex? Classifying Questions by Answering Complexity0
Towards a thematic role based target identification model for question answering0
Structured and Logical Representations of Assamese Text for Question-Answering System0
WikiTalk: A Spoken Wikipedia-based Open-Domain Knowledge Access System0
Thread Specific Features are Helpful for Identifying Subjectivity Orientation of Online Forum Threads0
Thai Sentence Paraphrasing from the Lexical Resource0
Anaphora Annotation in Hindi Dependency TreeBank0
Combining Social Cognitive Theories with Linguistic Features for Multi-genre Sentiment Analysis0
Answering Questions Requiring Cross-passage Evidence0
Annotation Scheme for Constructing Sentiment Corpus in Korean0
A Model of Vietnamese Person Named Entity Question Answering System0
Predicting Answer Location Using Shallow Semantic Analogical Reasoning in a Factoid Question Answering System0
Language Independent Sentence-Level Subjectivity Analysis with Feature Selection0
Introduction of a Probabilistic Language Model to Non-Factoid Question Answering Using Example Q\&A Pairs0
Explore Person Specific Evidence in Web Person Name Disambiguation0
Joint Learning of a Dual SMT System for Paraphrase Generation0
Graph Based Similarity Measures for Synonym Extraction from Parsed Text0
Learning Constraints for Consistent Timeline Extraction0
Crowdsourcing Inference-Rule Evaluation0
Improving Word Representations via Global Context and Multiple Word Prototypes0
No Noun Phrase Left Behind: Detecting and Typing Unlinkable Entities0
Community Answer Summarization for Multi-Sentence Question with Group L1 Regularization0
Qualitative Modeling of Spatial Prepositions and Motion Expressions0
An Unsupervised Ranking Model for Noun-Noun Compositionality0
Reinforcement Learning of Question-Answering Dialogue Policies for Virtual Museum Guides0
Collocation Polarity Disambiguation Using Web-based Pseudo Contexts0
DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets0
Efficient Search for Transformation-based Inference0
Crosslingual Induction of Semantic Roles0
An End-to-End Evaluation of Two Situated Dialog Systems0
Identifying Constant and Unique Relations by using Time-Series Text0
How to Evaluate Opinionated Keyphrase Extraction?0
Improving Implicit Discourse Relation Recognition Through Feature Set Optimization0
Efficient Tree-based Approximation for Entailment Graph Learning0
PolyUCOMP: Combining Semantic Vectors with Skip bigrams for Semantic Textual Similarity0
Prior versus Contextual Emotion of a Word in a Sentence0
Adaptive Clustering for Coreference Resolution with Deterministic Rules and Web-Based Language Models0
DERI\&UPM: Pushing Corpus Based Relatedness to Similarity: Shared Task System Description0
A Probabilistic Lexical Model for Ranking Textual Inferences0
How do Negation and Modality Impact on Opinions?0
Exploring Temporal Vagueness with Mechanical Turk0
English-Korean Named Entity Transliteration Using Substring Alignment and Re-ranking Methods0
ETS: Discriminative Edit Models for Paraphrase Scoring0
Annotating Coordination in the Penn Treebank0
A Reranking Model for Discourse Segmentation using Subtree Features0
Integrating Location, Visibility, and Question-Answering in a Spoken Dialogue System for Pedestrian City Exploration0
Learning to Temporally Order Medical Events in Clinical Text0
Mixed Membership Markov Models for Unsupervised Conversation Modeling0
Show:102550
← PrevPage 214 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified