SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 54765500 of 10817 papers

TitleStatusHype
SamToNe: Improving Contrastive Loss for Dual Encoder Retrieval Models with Same Tower Negatives0
Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes0
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
A Mathematical Abstraction for Balancing the Trade-off Between Creativity and Reality in Large Language Models0
A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers0
Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection0
KL-Divergence Guided Temperature SamplingCode0
Data-Efficient French Language Modeling with CamemBERTa0
LiT-4-RSVQA: Lightweight Transformer-based Visual Question Answering in Remote Sensing0
Evaluating the Capabilities of Multi-modal Reasoning Models with Synthetic Task Data0
Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes0
Reimagining Retrieval Augmented Language Models for Answering Queries0
Overcoming Language Bias in Remote Sensing Visual Question Answering via Adversarial Training0
Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models0
Building Extractive Question Answering System to Support Human-AI Health Coaching Model for Sleep Domain0
Knowledge Base Question Answering for Space Debris QueriesCode0
Attention-Based Methods For Audio Question Answering0
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA0
KEYword based Sampling (KEYS) for Large Language Models0
Cross Encoding as Augmentation: Towards Effective Educational Text Classification0
A template-independent approach for information extraction in real estate documentsCode0
Concise Answers to Complex Questions: Summarization of Long-form AnswersCode0
Graph Reasoning for Question Answering with Triplet Retrieval0
GPT4GEO: How a Language Model Sees the World's Geography0
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge0
Show:102550
← PrevPage 220 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified