SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 70767100 of 10817 papers

TitleStatusHype
On the effectiveness of small, discriminatively pre-trained language representation models for biomedical text mining0
On the Effects of Video Grounding on Language Models0
Knowledge Base Completion using Web-Based Question Answering and Multimodal Fusion0
deepCybErNet at EmoInt-2017: Deep Emotion Intensities in Tweets0
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables0
Knowledge-Aware Language Model Pretraining0
Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions0
On-the-fly Text Retrieval for End-to-End ASR Adaptation0
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering0
Knowledge-Aware Iterative Retrieval for Multi-Agent Systems0
On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models0
Knowledge-Aware Diverse Reranking for Cross-Source Question Answering0
Deep Contextualized Biomedical Abbreviation Expansion0
AutoRev: Automatic Peer Review System for Academic Research Papers0
Knowledge-Augmented Language Models Interpreting Structured Chest X-Ray Findings0
Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering0
On the Importance of Diversity in Question Generation for QA0
On the importance of pre-training data volume for compact language models0
Knowledge as A Bridge: Improving Cross-domain Answer Selection with External Knowledge0
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept0
On the Limitations of Vision-Language Models in Understanding Image Transforms0
Deep Cognitive Reasoning Network for Multi-hop Question Answering over Knowledge Graphs0
Knowledge AI: Fine-tuning NLP Models for Facilitating Scientific Knowledge Extraction and Understanding0
Knowledge-Aided Open-Domain Question Answering0
Knowledge Acquisition for Visual Question Answering via Iterative Querying0
Show:102550
← PrevPage 284 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified