SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1070110750 of 10817 papers

TitleStatusHype
UMichigan: A Conditional Random Field Model for Resolving the Scope of Negation0
UMCC\_DLSI: Multidimensional Lexical-Semantic Textual Similarity0
UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity Measures0
SAGAN: A Machine Translation Approach for Cross-Lingual Textual Entailment0
UAlacant: Using Online Machine Translation for Cross-Lingual Textual Entailment0
Tiantianzhu7:System Description of Semantic Textual Similarity (STS) in the SemEval-2012 (Task 6)0
SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity0
DERI\&UPM: Pushing Corpus Based Relatedness to Similarity: Shared Task System Description0
DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets0
ETS: Discriminative Edit Models for Paraphrase Scoring0
PolyUCOMP: Combining Semantic Vectors with Skip bigrams for Semantic Textual Similarity0
An Unsupervised Ranking Model for Noun-Noun Compositionality0
Monolingual Distributional Similarity for Text-to-Text Generation0
Adaptive Clustering for Coreference Resolution with Deterministic Rules and Web-Based Language Models0
A Probabilistic Lexical Model for Ranking Textual Inferences0
Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia0
UWN: A Large Multilingual Lexical Knowledge Base0
Qualitative Modeling of Spatial Prepositions and Motion Expressions0
IRIS: a Chat-oriented Dialogue System based on the Vector Space Model0
Joint Learning of a Dual SMT System for Paraphrase Generation0
Computational Approaches to Sentence Completion0
Crosslingual Induction of Semantic Roles0
Community Answer Summarization for Multi-Sentence Question with Group L1 Regularization0
A Discriminative Hierarchical Model for Fast Coreference at Large Scale0
Improving Word Representations via Global Context and Multiple Word Prototypes0
Crowdsourcing Inference-Rule Evaluation0
Movie-DiC: a Movie Dialogue Corpus for Research and Development0
Learning to Temporally Order Medical Events in Clinical Text0
Bayesian Symbol-Refined Tree Substitution Grammars for Syntactic Parsing0
Efficient Search for Transformation-based Inference0
Efficient Tree-based Approximation for Entailment Graph Learning0
Pattern Learning for Relation Extraction with a Hierarchical Topic Model0
Text-level Discourse Parsing with Rich Linguistic Features0
Unsupervised Relation Discovery with Sense Disambiguation0
Sentence Dependency Tagging in Online Question Answering Forums0
Typologie des questions \`a r\'eponses multiples pour un syst\`eme de question-r\'eponse (Typology of Multiple Answer Questions for a Question-answering System) [in French]0
A Study of Heterogeneous Similarity Measures for Semantic Relation Extraction0
Constructing a Textual KB from a Biology TextBook0
Knowledge Extraction and Joint Inference Using Tractable Markov Logic0
Probabilistic Databases of Universal Schema0
Analyzing Patient Records to Establish If and When a Patient Suffered from a Medical Condition0
PREFER: Using a Graph-Based Approach to Generate Paraphrases for Language Learning0
Nudging the Envelope of Direct Transfer Methods for Multilingual Named Entity Recognition0
Natural Language Processing in Watson0
Predicting Structures in NLP: Constrained Conditional Models and Integer Linear Programming in NLP0
100 Things You Always Wanted to Know about Linguistics But Were Afraid to Ask*0
On-Demand Distributional Semantic Distance and Paraphrasing0
Grammatical structures for word-level sentiment detection0
Taxonomy Induction Using Hierarchical Random Graphs0
Using paraphrases for improving first story detection in news and Twitter0
Show:102550
← PrevPage 215 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified