SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 96519700 of 10817 papers

TitleStatusHype
Learned in Translation: Contextualized Word VectorsCode0
Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-TuningCode0
BEEDS: Large-Scale Biomedical Event Extraction using Distant Supervision and Question AnsweringCode0
Learning a Cost-Effective Annotation Policy for Question AnsweringCode0
Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning TaskCode0
Analyzing Social Biases in Japanese Large Language ModelsCode0
PlainQAFact: Automatic Factuality Evaluation Metric for Biomedical Plain Language Summaries GenerationCode0
Learning a Natural Language Interface with Neural ProgrammerCode0
Comparative Study of Machine Learning Models and BERT on SQuADCode0
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented GenerationCode0
Multi-step Retriever-Reader Interaction for Scalable Open-domain Question AnsweringCode0
Compact Trilinear Interaction for Visual Question AnsweringCode0
Question Answering through Transfer Learning from Large Fine-grained Supervision DataCode0
FedQAS: Privacy-aware machine reading comprehension with federated learningCode0
CommVQA: Situating Visual Question Answering in Communicative ContextsCode0
Multi-Target Embodied Question AnsweringCode0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language ModelsCode0
Multi-tasking Dialogue Comprehension with Discourse ParsingCode0
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language ReasoningCode0
Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge BaseCode0
Multi-task Learning with Sample Re-weighting for Machine Reading ComprehensionCode0
p-Laplacian Adaptation for Generative Pre-trained Vision-Language ModelsCode0
Learning Conditioned Graph Structures for Interpretable Visual Question AnsweringCode0
BatchPrompt: Accomplish more with lessCode0
Learning content and context with language bias for Visual Question AnsweringCode0
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent VariableCode0
Multi-Task Learning with Multi-View Attention for Answer Selection and Knowledge Base Question AnsweringCode0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing TasksCode0
Plausible Extractive Rationalization through Semi-Supervised Entailment SignalCode0
Robust Data Watermarking in Language Models by Injecting Fictitious KnowledgeCode0
MULTITAT: Benchmarking Multilingual Table-and-Text Question AnsweringCode0
Playing Text-Adventure Games with Graph-Based Deep Reinforcement LearningCode0
FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated Knowledge InjectionCode0
Federated Document Visual Question Answering: A Pilot StudyCode0
Learning Distributed Representations of Texts and Entities from Knowledge BaseCode0
Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question AnsweringCode0
Analyzing Semantic Faithfulness of Language Models via Input Intervention on Question AnsweringCode0
Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific RewardsCode0
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERTCode0
Barlow constrained optimization for Visual Question AnsweringCode0
BAND: Biomedical Alert News DatasetCode0
Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural NetworksCode0
Fast Reading Comprehension with ConvNetsCode0
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero TrainingCode0
Fast Linear Model for Knowledge Graph EmbeddingsCode0
Multi-View Graph Representation Learning for Answering Hybrid Numerical Reasoning QuestionCode0
Learning from Lexical Perturbations for Consistent Visual Question AnsweringCode0
Multi-VQG: Generating Engaging Questions for Multiple ImagesCode0
Community Question Answering Entity Linking via Leveraging Auxiliary DataCode0
Show:102550
← PrevPage 194 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified