SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 72517300 of 10817 papers

TitleStatusHype
PaLI: A Jointly-Scaled Multilingual Language-Image Model0
GUITAR: Gradient Pruning toward Fast Neural Ranking0
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning0
PaLM 2 Technical Report0
Guiding Visual Question Answering with Attention Priors0
A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection0
Phase Conductor on Multi-layered Attentions for Machine Comprehension0
PALM: Pre-training an Autoencoding\&Autoregressive Language Model for Context-conditioned Generation0
Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting0
PALRACE: Reading Comprehension Dataset with Human Data and Labeled Rationales0
PAM: Understanding Product Images in Cross Product Category Attribute Extraction0
A Study of the Importance of External Knowledge in the Named Entity Recognition Task0
Pangloss: Fast Entity Linking in Noisy Text Environments0
Perspective Transition of Large Language Models for Solving Subjective Tasks0
Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning0
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing0
Guess What: A Question Answering Game via On-demand Knowledge Validation0
Amrita\_CEN at SemEval-2016 Task 1: Semantic Relation from Word Embeddings in Higher Dimension0
Perturbation-based Active Learning for Question Answering0
PaperQA: Retrieval-Augmented Generative Agent for Scientific Research0
Exploiting Rich Syntax for Better Knowledge Base Question Answering0
Philosophers are Mortal: Inferring the Truth of Unseen Facts0
PAQA: Toward ProActive Open-Retrieval Question Answering0
ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs0
PARADIGM: Paraphrase Diagnostics through Grammar Matching0
Guess Me if You Can: Acronym Disambiguation for Enterprises0
Conditional Generation with a Question-Answering Blueprint0
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing0
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost0
GTR-LSTM: A Triple Encoder for Sentence Generation from RDF Data0
Parallelizing Word2Vec in Shared and Distributed Memory0
Parallel Key-Value Cache Fusion for Position Invariant RAG0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?0
Parameter-Efficient Abstractive Question Answering over Tables and over Text0
Exploiting User Search Sessions for the Semantic Categorization of Question-like Informational Search Queries0
CFO: A Framework for Building Production NLP Systems0
A Study of the Effect of Resolving Negation and Sentiment Analysis in Recognizing Text Entailment for Arabic0
GTR: Graph-Table-RAG for Cross-Table Question Answering0
Parameter-Efficient Neural Question Answering Models via Graph-Enriched Document Representations0
gTBLS: Generating Tables from Text by Conditional Question Answering0
Parameter-free Video Segmentation for Vision and Language Understanding0
Paraphrase-Driven Learning for Open Question Answering0
Paraphrase for Open Question Answering: New Dataset and Methods0
Paraphrase Generation from Latent-Variable PCFGs for Semantic Parsing0
GSQA: An End-to-End Model for Generative Spoken Question Answering0
Paraphrasing in Affirmative Terms Improves Negation Understanding0
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge for Commonsense Reasoning0
Paraphrasing with Large Language Models0
Paraphrastic Variance between European and Brazilian Portuguese0
AMR Beyond the Sentence: the Multi-sentence AMR corpus0
Show:102550
← PrevPage 146 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified