SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1055110600 of 10817 papers

TitleStatusHype
AIDA: Artificial Intelligent Dialogue Agent0
A Lightweight and High Performance Monolingual Word Aligner0
Paraphrase-Driven Learning for Open Question Answering0
Automatic Coupling of Answer Extraction and Information Retrieval0
Minimum Bayes Risk based Answer Re-ranking for Question Answering0
Joint Inference for Fine-grained Opinion Extraction0
Philosophers are Mortal: Inferring the Truth of Unseen Facts0
Automatic Interpretation of the English Possessive0
Reinforcement Learning of Two-Issue Negotiation Dialogue Policies0
Answer Extraction by Recursive Parse Tree Descent0
Latent Semantic Tensor Indexing for Community-based Question Answering0
Question Analysis for Polish Question Answering0
Interpreting Consumer Health Questions: The Role of Anaphora and Ellipsis0
Cross-lingual Transfer of Semantic Role Labeling Models0
Joint Inference for Heterogeneous Dependency Parsing0
Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis0
Evaluating Text Segmentation using Boundary Edit Distance0
Feature-Based Selection of Dependency Paths in Ad Hoc Information Retrieval0
Aggregated Word Pair Features for Implicit Discourse Relation Disambiguation0
FrameNet on the Way to Babel: Creating a Bilingual FrameNet Using Wiktionary as Interlingual Connection0
Decipherment0
Name-aware Machine Translation0
Learning Adaptable Patterns for Passage Reranking0
Deceptive Answer Prediction with User Preference Graph0
Evaluating a City Exploration Dialogue System with Integrated Question-Answering and Pedestrian Navigation0
Leveraging Synthetic Discourse Data via Multi-task Learning for Implicit Discourse Relation Recognition0
PLIS: a Probabilistic Lexical Inference System0
A Case Study Towards Turkish Paraphrase Alignment0
Meet EDGAR, a tutoring agent at MONSERRATE0
Frequently Asked Questions Retrieval for Croatian Based on Semantic Textual Similarity0
MT Quality Estimation: The CMU System for WMT’130
A Multithreaded Conversational Interface for Pedestrian Navigation and Question Answering0
Exploiting Qualitative Information from Automatic Word Alignment for Cross-lingual NLP Tasks0
A Novel Architecture For Question Classification Based Indexing Scheme For Efficient Question Answering0
ECNUCS: Measuring Short Text Semantic Equivalence Using Multiple Similarity Measurements0
DLS@CU-CORE: A Simple Machine Learning Model of Semantic Textual Similarity0
Finding What Matters in Questions0
HENRY-CORE: Domain Adaptation and Stacking for Text Similarity0
KLUE-CORE: A regression model of semantic textual similarity0
INAOE\_UPV-CORE: Extracting Word Associations from Document Corpora to estimate Semantic Textual Similarity0
CNGL: Grading Student Answers by Acts of Translation0
Coarse to Fine Grained Sense Disambiguation in Wikipedia0
JU\_CSE: A CRF Based Approach to Annotation of Temporal Expression, Event and Temporal Relations0
Large-Scale Paraphrasing for Natural Language Understanding0
DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation0
Combining Heterogeneous Models for Measuring Relational Similarity0
LIPN-CORE: Semantic Text Similarity using n-grams, WordNet, Syntactic Analysis, ESA and Information Retrieval based Features0
Answer Extraction as Sequence Tagging with Tree Edit Distance0
LIMSIILES: Basic English Substitution for Student Answer Assessment at SemEval 20130
ECNUCS: Recognizing Cross-lingual Textual Entailment Using Multiple Text Similarity and Text Difference Measures0
Show:102550
← PrevPage 212 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified