SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 61516200 of 10817 papers

TitleStatusHype
Grounded Graph Decoding Improves Compositional Generalization in Question AnsweringCode0
Medicines Question Answering System, MeQA0
Reducing the impact of out of vocabulary words in the translation of natural language questions into SPARQL queries0
SERC: Syntactic and Semantic Sequence based Event Relation Classification0
UQuAD1.0: Development of an Urdu Question Answering Training Data for Machine Reading Comprehension0
Callee: Recovering Call Graphs for Binaries with Transfer and Contrastive LearningCode1
ViVQA: Vietnamese Visual Question AnsweringCode1
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
Multi-Domain Multilingual Question AnsweringCode1
ConQuest: Contextual Question Paraphrasing through Answer-Aware Synthetic Question Generation0
Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization0
A Fact Checking and Verification System for FEVEROUS Using a Zero-Shot Learning Approach0
The Global Banking Standards QA Dataset (GBS-QA)0
Evaluation Paradigms in Question Answering0
Q^2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering0
What’s in a Name? Answer Equivalence For Open-Domain Question AnsweringCode0
Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence AnnotationCode1
MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering DatasetCode1
Have You Seen That Number? Investigating Extrapolation in Question Answering Models0
What’s in Your Head? Emergent Behaviour in Multi-Task Transformer Models0
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval0
Toward Deconfounding the Effect of Entity Demographics for Question Answering Accuracy0
Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap0
SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map0
Eliciting Bias in Question Answering Models through AmbiguityCode0
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak DecoderCode1
Narrative Embedding: Re-Contextualization Through Attention0
Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language ModelsCode1
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization0
Incorporating medical knowledge in BERT for clinical relation extraction0
Coupling Context Modeling with Zero Pronoun Recovering for Document-Level Natural Language GenerationCode0
Improving Query Graph Generation for Complex Question Answering over Knowledge Base0
GANDALF: a General Character Name Description Dataset for Long Fiction0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
Zero-Shot Clinical Questionnaire Filling From Human-Machine Interactions0
Adapting Entities across Languages and Cultures0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionCode0
Can NLI Models Verify QA Systems’ Predictions?Code1
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base0
Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation0
KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple SubgoalsCode0
AutoEQA: Auto-Encoding Questions for Extractive Question Answering0
A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question Answering0
Using Question Answering Rewards to Improve Abstractive SummarizationCode0
A Multi-label Multi-hop Relation Detection Model based on Relation-aware Sequence Generation0
Textual Time Travel: A Temporally Informed Approach to Theory of Mind0
Aspect-based Sentiment Analysis in Question Answering ForumsCode0
Show:102550
← PrevPage 124 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified