SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 75017550 of 10817 papers

TitleStatusHype
PlotQA: Reasoning over Scientific Plots0
Predicting Helpful Posts in Open-Ended Discussion Forums: A Neural Architecture0
Automatic Evaluation of Summary Using Textual Entailment0
An Emotional Comfort Framework for Improving User Satisfaction in E-Commerce Customer Service Chatbots0
Predicting Question Quality on StackOverflow with Neural Networks0
Predicting Relative Depth between Objects from Semantic Features0
Predicting Structures in NLP: Constrained Conditional Models and Integer Linear Programming in NLP0
Advancing Chinese biomedical text mining with community challenges0
A Combined Pattern-based and Distributional Approach for Automatic Hypernym Detection in Dutch.0
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam0
Predicting the impact of dataset composition on model performance0
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering0
MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph0
Prediction of the Realisation of an Information Need: An EEG Study0
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network0
1-800-SHARED-TASKS at RegNLP: Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering0
Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata0
PREFER: Using a Graph-Based Approach to Generate Paraphrases for Language Learning0
Joint Learning with Global Inference for Comment Classification in Community Question Answering0
Joint learning of object graph and relation graph for visual question answering0
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure0
PreSTU: Pre-Training for Scene-Text Understanding0
Joint Learning of Entity Linking Constraints Using a Markov-Logic Network0
Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs0
Joint Learning of a Dual SMT System for Paraphrase Generation0
Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach0
Data-Efficient French Language Modeling with CamemBERTa0
Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain0
Joint Inference for Heterogeneous Dependency Parsing0
Joint Inference for Fine-grained Opinion Extraction0
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?0
Pre-training image-language transformers for open-vocabulary tasks0
Data-Efficient Autoregressive Document Retrieval for Fact Verification0
Pre-training Language Models with Deterministic Factual Knowledge0
Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks0
Joint Image Captioning and Question Answering0
Joint Event Trigger Identification and Event Coreference Resolution with Structured Perceptron0
Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction0
Pre-training Universal Language Representation0
Joint Event Extraction along Shortest Dependency Paths using Graph Convolutional Networks0
Pretrain Knowledge-Aware Language Models0
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering0
Joint Entity Recognition and Disambiguation0
Automatic Coupling of Answer Extraction and Information Retrieval0
Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components0
JoBimText Visualizer: A Graph-based Approach to Contextualizing Distributional Similarity0
Data augmentation techniques for the Video Question Answering task0
Automatic Compound Processing: Compound Splitting and Semantic Analysis for Afrikaans and Dutch0
Advances in Natural Language Question Answering: A Review0
Data Augmentation for Visual Question Answering0
Show:102550
← PrevPage 151 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified