SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 68016850 of 10817 papers

TitleStatusHype
Zero-Shot Clinical Questionnaire Filling From Human-Machine Interactions0
The Global Banking Standards QA Dataset (GBS-QA)0
A Free Format Legal Question Answering System0
SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map0
A Fact Checking and Verification System for FEVEROUS Using a Zero-Shot Learning Approach0
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation0
Toward Deconfounding the Effect of Entity Demographics for Question Answering Accuracy0
Discourse Comprehension: A Question Answering Framework to Represent Sentence ConnectionsCode0
Learning from Limited Labels for Long Legal Dialogue0
Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation0
KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple SubgoalsCode0
AutoEQA: Auto-Encoding Questions for Extractive Question Answering0
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization0
A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question Answering0
Incorporating medical knowledge in BERT for clinical relation extraction0
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionCode0
Coupling Context Modeling with Zero Pronoun Recovering for Document-Level Natural Language GenerationCode0
Improving Query Graph Generation for Complex Question Answering over Knowledge Base0
Understanding the Extent to which Content Quality Metrics Measure the Information Quality of Summaries0
A Transformer Based Approach towards Identification of Discourse Unit Segments and Connectives0
Q^2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering0
Winnowing Knowledge for Multi-choice Question Answering0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
Have You Seen That Number? Investigating Extrapolation in Question Answering Models0
A Multi-label Multi-hop Relation Detection Model based on Relation-aware Sequence Generation0
ConQuest: Contextual Question Paraphrasing through Answer-Aware Synthetic Question Generation0
Textual Time Travel: A Temporally Informed Approach to Theory of Mind0
Using Question Answering Rewards to Improve Abstractive SummarizationCode0
Aspect-based Sentiment Analysis in Question Answering ForumsCode0
GANDALF: a General Character Name Description Dataset for Long Fiction0
Text Classification for Task-based Source Code Related Questions0
DSC-IITISM at FinCausal 2021: Combining POS tagging with Attention-based Contextual Representations for Identifying Causal Relationships in Financial Documents0
Path-Enhanced Multi-Relational Question Answering with Knowledge Graph Embeddings0
On the Feasibility of Predicting Questions being Forgotten in Stack Overflow0
Learning Representations for Zero-Shot Retrieval over Structured Data0
What makes us curious? analysis of a corpus of open-domain questions0
Multi-stage Clarification in Conversational AI: The case of Question-Answering Dialogue Systems0
Perceptual Score: What Data Modalities Does Your Model Perceive?Code0
SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning0
Ask me in your own words: paraphrasing for multitask question answeringCode0
Transferring Domain-Agnostic Knowledge in Video Question Answering0
Alignment Attention by Matching Key and Query DistributionsCode0
EDG-Based Question Decomposition for Complex Question Answering over Knowledge Bases0
ListReader: Extracting List-form Answers for Opinion Questions0
Single-Modal Entropy based Active Learning for Visual Question Answering0
Why Settle for Just One? Extending EL++ Ontology Embeddings with Many-to-Many Relationships0
Ensemble ALBERT on SQuAD 2.0Code0
DEEPAGÉ: Answering Questions in Portuguese about the Brazilian EnvironmentCode0
Ranking Facts for Explaining Answers to Elementary Science Questions0
Towards Language-guided Visual Recognition via Dynamic ConvolutionsCode0
Show:102550
← PrevPage 137 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified