SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1027610300 of 10817 papers

TitleStatusHype
Generating Supplementary Travel Guides from Social Media0
An Analysis of Causality between Events and its Relation to Temporal Information0
A Hybrid Approach to Features Representation for Fine-grained Arabic Named Entity Recognition0
Knowledge Sharing via Social Login: Exploiting Microblogging Service for Warming up Social Question Answering Websites0
Nerdle: Topic-Specific Question Answering Using Wikia Seeds0
Argument structure of adverbial derivatives in Russian0
Exploring Fine-grained Entity Type Constraints for Distantly Supervised Relation Extraction0
Learning to Summarise Related Sentences0
Biomedical/Clinical NLP0
Dependency Parsing: Past, Present, and Future0
HARPY: Hypernyms and Alignment of Relational Paraphrases0
Converting Phrase Structures to Dependency Structures in Sanskrit0
Unsupervised Verb Inference from Nouns Crossing Root Boundary0
Towards an open-domain conversational system fully based on natural language processing0
What or Who is Multilingual Watson?0
Hierarchical Topical Segmentation with Affinity Propagation0
A Three-Step Transition-Based System for Non-Projective Dependency Parsing0
Group Non-negative Matrix Factorization with Natural Categories for Question Retrieval in Community Question Answer Archives0
Building English-Vietnamese Named Entity Corpus with Aligned Bilingual News Articles0
Automatic Compound Processing: Compound Splitting and Semantic Analysis for Afrikaans and Dutch0
Experiments with Easy-first nonprojective constituent parsing0
Annotate and Identify Modalities, Speech Acts and Finer-Grained Event Types in Chinese Text0
Towards Identifying Hindi/Urdu Noun Templates in Support of a Large-Scale LFG Grammar0
Word Clustering Based on Un-LP Algorithm0
Towards Model Driven Architectures for Human Language Technologies0
Show:102550
← PrevPage 412 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified