SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1075110800 of 10817 papers

TitleStatusHype
Structured Event Retrieval over Microblog Archives0
Topical Segmentation: a Study of Human Performance and a New Measure of Quality.0
Tools for plWordNet Development. Presentation and Perspectives0
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information0
The FLaReNet Strategic Language Resource Agenda0
Turkish Paraphrase Corpus0
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World0
Clause-based Discourse Segmentation of Arabic Texts0
Learning Sentiment Lexicons in Spanish0
Constructing a Question Corpus for Textual Semantic Relations0
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.0
Parsing Any Domain English text to CoNLL dependencies0
Chinese Whispers: Cooperative Paraphrase Acquisition0
Constraint Based Description of Polish Multiword Expressions0
Evaluating Multi-focus Natural Language Queries over Data Services0
Collecting humorous expressions from a community-based question-answering-service corpus0
P\'agico: Evaluating Wikipedia-based information retrieval in Portuguese0
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual0
Kitten: a tool for normalizing HTML and extracting its textual content0
Applying Random Indexing to Structured Data to Find Contextually Similar Words0
A corpus of general and specific sentences from news0
Evaluating Machine Reading Systems through Comprehension Tests0
Creation and use of Language Resources in a Question-Answering eHealth System0
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing0
Building and Exploring Semantic Equivalences Resources0
Constructing Large Proposition Databases0
Adding Morpho-semantic Relations to the Romanian Wordnet0
Polaris: Lymba's Semantic Parser0
Annotating Opinions in German Political News0
Identifying Nuggets of Information in GALE Distillation Evaluation0
Propbank-Br: a Brazilian Treebank annotated with semantic role labels0
Assessing Crowdsourcing Quality through Objective Tasks0
DBpedia: A Multilingual Cross-domain Knowledge Base0
Automatic lexical semantic classification of nouns0
MLSA --- A Multi-layered Reference Corpus for German Sentiment Analysis0
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT0
DISLOG: A logic-based language for processing discourse structures0
QurAna: Corpus of the Quran annotated with Pronominal Anaphora0
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese0
SUTime: A library for recognizing and normalizing time expressions0
Visualizing Sentiment Analysis on a User Forum0
The TARSQI Toolkit0
Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution0
TIMEN: An Open Temporal Expression Normalisation ResourceCode0
KBGen -- Text Generation from Knowledge Bases as a New Shared Task0
Interactive Natural Language Query Construction for Report Generation0
Methods Combination and ML-based Re-ranking of Multiple Hypothesis for Question-Answering Systems0
Experiments on Hybrid Corpus-Based Sentiment Lexicon Acquisition0
Looking at word meaning. An interactive visualization of Semantic Vector Spaces for Dutch synsets0
Coupling Knowledge-Based and Data-Driven Systems for Named Entity Recognition0
Show:102550
← PrevPage 216 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified