SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 75017525 of 10817 papers

TitleStatusHype
1-800-SHARED-TASKS at RegNLP: Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering0
Predicting Helpful Posts in Open-Ended Discussion Forums: A Neural Architecture0
Joint Learning with Global Inference for Comment Classification in Community Question Answering0
Joint learning of object graph and relation graph for visual question answering0
Predicting Question Quality on StackOverflow with Neural Networks0
Predicting Relative Depth between Objects from Semantic Features0
Predicting Structures in NLP: Constrained Conditional Models and Integer Linear Programming in NLP0
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure0
Joint Learning of Entity Linking Constraints Using a Markov-Logic Network0
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam0
Predicting the impact of dataset composition on model performance0
Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs0
Joint Learning of a Dual SMT System for Paraphrase Generation0
Prediction of the Realisation of an Information Need: An EEG Study0
Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach0
Data-Efficient French Language Modeling with CamemBERTa0
Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata0
PREFER: Using a Graph-Based Approach to Generate Paraphrases for Language Learning0
Joint Inference for Heterogeneous Dependency Parsing0
Joint Inference for Fine-grained Opinion Extraction0
Data-Efficient Autoregressive Document Retrieval for Fact Verification0
PreSTU: Pre-Training for Scene-Text Understanding0
Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks0
Joint Image Captioning and Question Answering0
Joint Event Trigger Identification and Event Coreference Resolution with Structured Perceptron0
Show:102550
← PrevPage 301 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified