SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1070110750 of 10817 papers

TitleStatusHype
A New Minimally-Supervised Framework for Domain Word Sense Disambiguation0
Learning to Model Multilingual Unrestricted Coreference in OntoNotes0
SAGAN: A Machine Translation Approach for Cross-Lingual Textual Entailment0
SAGAN: An approach to Semantic Textual Similarity based on Textual Entailment0
Computational Approaches to Sentence Completion0
Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web0
Movie-DiC: a Movie Dialogue Corpus for Research and Development0
Learning Verb Inference Rules from Linguistically-Motivated Evidence0
Monolingual Distributional Similarity for Text-to-Text Generation0
A Novel Discriminative Framework for Sentence-Level Discourse Analysis0
Answering Opinion Questions on Products by Exploiting Hierarchical Organization of Consumer Reviews0
Pattern Learning for Relation Extraction with a Hierarchical Topic Model0
A Discriminative Hierarchical Model for Fast Coreference at Large Scale0
Extracting Opinion Expressions with semi-Markov Conditional Random Fields0
PATTY: A Taxonomy of Relational Patterns with Semantic Types0
Extracting Context-Rich Entailment Rules from Wikipedia Revision History0
IRIS: a Chat-oriented Dialogue System based on the Vector Space Model0
Entity based Q\&A Retrieval0
Contingency and Comparison Relation Labeling and Structure Prediction in Chinese Sentences0
Bayesian Symbol-Refined Tree Substitution Grammars for Syntactic Parsing0
sranjans : Semantic Textual Similarity using Maximal Weighted Bipartite Graph Matching0
UMCC\_DLSI: Multidimensional Lexical-Semantic Textual Similarity0
Tiantianzhu7:System Description of Semantic Textual Similarity (STS) in the SemEval-2012 (Task 6)0
Unsupervised Relation Discovery with Sense Disambiguation0
SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity0
Semeval-2012 Task 8: Cross-lingual Textual Entailment for Content Synchronization0
UAlacant: Using Online Machine Translation for Cross-Lingual Textual Entailment0
Sentence Dependency Tagging in Online Question Answering Forums0
UMichigan: A Conditional Random Field Model for Resolving the Scope of Negation0
Text-level Discourse Parsing with Rich Linguistic Features0
UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity Measures0
University\_Of\_Sheffield: Two Approaches to Semantic Text Similarity0
UWN: A Large Multilingual Lexical Knowledge Base0
Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia0
Why Question Answering using Sentiment Analysis and Word Classes0
Taxonomy Induction Using Hierarchical Random Graphs0
Typologie des questions \`a r\'eponses multiples pour un syst\`eme de question-r\'eponse (Typology of Multiple Answer Questions for a Question-answering System) [in French]0
Topical Segmentation: a Study of Human Performance and a New Measure of Quality.0
Structured Event Retrieval over Microblog Archives0
Using paraphrases for improving first story detection in news and Twitter0
Natural Language Processing in Watson0
A Study of Heterogeneous Similarity Measures for Semantic Relation Extraction0
On-Demand Distributional Semantic Distance and Paraphrasing0
Grammatical structures for word-level sentiment detection0
Knowledge Extraction and Joint Inference Using Tractable Markov Logic0
100 Things You Always Wanted to Know about Linguistics But Were Afraid to Ask*0
Predicting Structures in NLP: Constrained Conditional Models and Integer Linear Programming in NLP0
Analyzing Patient Records to Establish If and When a Patient Suffered from a Medical Condition0
Probabilistic Databases of Universal Schema0
Constructing a Textual KB from a Biology TextBook0
Show:102550
← PrevPage 215 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified