SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 67516800 of 10817 papers

TitleStatusHype
RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering0
Cross-Task Generalization via Natural Language Crowdsourcing Instructions0
Unsupervised multiple-choice question generation for out-of-domain Q\&A fine-tuning0
Parameter-Efficient Abstractive Question Answering over Tables and over Text0
Reasoning over Hybrid Chain for Table-and-Text Open Domain Question Answering0
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions0
CQARE: Contrastive Question-Answering for Few-shot Relation Extraction with Prompt Tuning0
Co-VQA : Answering by Interactive Sub Question Sequence0
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment0
TABi: Type-Aware Bi-encoders for End-to-End Entity Retrieval0
Hyperlink-induced Pre-training for Passage Retrieval of Open-domain Question Answering0
Context-Paraphrase Enhanced Commonsense Question Answering0
Question-Led Semantic Structure Enhanced Attentions for VQA0
Get Your Model Puzzled: Introducing Crossword-Solving as a New NLP Benchmark0
GenRE: A Generative Model for Relation Extraction0
Ask Me Anything in Your Native Language0
Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection0
Calculating Question Similarity is Enough: A New Method for KBQA Tasks0
Question Answering for Complex Electronic Health Records Database using Unified Encoder-Decoder Architecture0
A Chinese Multi-type Complex Questions Answering Dataset over Wikidata0
Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture0
Prune Once for All: Sparse Pre-Trained Language Models0
Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding0
A Two-Stage Approach towards Generalization in Knowledge Base Question Answering0
Pre-trained Transformer-Based Approach for Arabic Question Answering : A Comparative Study0
Recent Advances in Automated Question Answering In Biomedical Domain0
MNet-Sim: A Multi-layered Semantic Similarity Network to Evaluate Sentence Similarity0
Ontology-based question answering over corporate structured data0
Visual Question Answering based on Formal Logic0
Grounded Graph Decoding Improves Compositional Generalization in Question AnsweringCode0
Medicines Question Answering System, MeQA0
Reducing the impact of out of vocabulary words in the translation of natural language questions into SPARQL queries0
SERC: Syntactic and Semantic Sequence based Event Relation Classification0
UQuAD1.0: Development of an Urdu Question Answering Training Data for Machine Reading Comprehension0
Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization0
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed DomainsCode0
Adapting Entities across Languages and Cultures0
Q^2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering0
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionCode0
Textual Time Travel: A Temporally Informed Approach to Theory of Mind0
Relation-aware Bidirectional Path Reasoning for Commonsense Question Answering0
Evaluation Paradigms in Question Answering0
Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap0
Can predicate-argument relationships be extracted from UD trees?0
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base0
Using Question Answering Rewards to Improve Abstractive SummarizationCode0
Winnowing Knowledge for Multi-choice Question Answering0
Narrative Embedding: Re-Contextualization Through Attention0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Show:102550
← PrevPage 136 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified