SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 73017325 of 10817 papers

TitleStatusHype
ParaQuery: Making Sense of Paraphrase Collections0
Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison0
Exploring and Evaluating Personalized Models for Code Generation0
Chain Based RNN for Relation Classification0
Guess What: A Question Answering Game via On-demand Knowledge Validation0
Parsing with Traces: An O(n4) Algorithm and a Structural Representation0
Amrita\_CEN at SemEval-2016 Task 1: Semantic Relation from Word Embeddings in Higher Dimension0
Guess Me if You Can: Acronym Disambiguation for Enterprises0
Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-hop Inference0
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery0
Conditional Generation with a Question-Answering Blueprint0
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost0
Passage Segmentation of Documents for Extractive Question Answering0
Exploring Diverse Expressions for Paraphrase Generation0
Patch-level Sounding Object Tracking for Audio-Visual Question Answering0
Exploring Diverse Methods in Visual Question Answering0
GTR-LSTM: A Triple Encoder for Sentence Generation from RDF Data0
Pathological Visual Question Answering0
A Study of the Effect of Resolving Negation and Sentiment Analysis in Recognizing Text Entailment for Arabic0
PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering0
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback0
PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks0
GTR: Graph-Table-RAG for Cross-Table Question Answering0
gTBLS: Generating Tables from Text by Conditional Question Answering0
GSQA: An End-to-End Model for Generative Spoken Question Answering0
Show:102550
← PrevPage 293 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified