SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 48014825 of 10817 papers

TitleStatusHype
BinaryVQA: A Versatile Test Set to Evaluate the Out-of-Distribution Generalization of VQA ModelsCode0
ACL-Fig: A Dataset for Scientific Figure Classification0
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation0
ThoughtSource: A central hub for large language model reasoning dataCode3
A Comparative Study of Pretrained Language Models for Long Clinical TextCode1
Graph Attention with Hierarchies for Multi-hop Question Answering0
Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering0
ViDeBERTa: A powerful pre-trained language model for VietnameseCode1
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language ModelsCode0
Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute0
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction0
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and DevelopmentCode2
HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images0
Ensemble Transfer Learning for Multilingual Coreference Resolution0
Champion Solution for the WSDM2023 Toloka VQA ChallengeCode3
Weakly-Supervised Questions for Zero-Shot Relation ExtractionCode0
Rationalization for Explainable NLP: A Survey0
Reversing The Twenty Questions Game0
Temporal Perceiving Video-Language Pre-training0
Towards Models that Can See and Read0
Curriculum Script Distillation for Multilingual Visual Question Answering0
SlideVQA: A Dataset for Document Visual Question Answering on Multiple ImagesCode1
Explaining ELH Concept Descriptions through Counterfactual Reasoning0
Semantic Web Enabled Geographic Question Answering Framework: GeoTR0
Multimodal Inverse Cloze Task for Knowledge-based Visual Question AnsweringCode1
Show:102550
← PrevPage 193 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified