SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 64766500 of 10817 papers

TitleStatusHype
Mongolian Questions Classification Based on Mulit-Head Attention0
Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models0
Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases0
A Survey on Knowledge-Oriented Retrieval-Augmented Generation0
How Context Affects Language Models' Factual Predictions0
Monolingual Social Media Datasets for Detecting Contradiction and Entailment0
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation0
A Multi-Source Retrieval Question Answering Framework Based on RAG0
How Can Objects Help Video-Language Understanding?0
Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling0
A Survey on Knowledge Graph Embeddings with Literals: Which model links better Literal-ly?0
Accounting for Focus Ambiguity in Visual Questions0
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering0
Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing0
Morpho-syntactic Lexical Generalization for CCG Semantic Parsing0
Morpho-Syntactic Study of Errors from Speech Recognition System0
A Survey on Graph Neural Networks for Knowledge Graph Completion0
MoRS at SemEval-2017 Task 3: Easy to use SVM in Ranking Tasks0
MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles0
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision0
MOSMOS: Multi-organ segmentation facilitated by medical report supervision0
Motion-Appearance Co-Memory Networks for Video Question Answering0
HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions0
Contextual Code Switching for Machine Translation using Language Models0
A Multi-Resolution Word Embedding for Document Retrieval from Large Unstructured Knowledge Bases0
Show:102550
← PrevPage 260 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified