SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 55015550 of 10817 papers

TitleStatusHype
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing0
A Novel Word Sense Disambiguation Approach Using WordNet Knowledge Graph0
A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
Dynamic Integration of Background Knowledge in Neural NLU Systems0
Dynamic Graph Generation Network: Generating Relational Knowledge from Diagrams0
A novel knowledge graph development for industry design: A case study on indirect coal liquefaction process0
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering0
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering0
Dynamic Few-Shot Learning for Knowledge Graph Question Answering0
A novel interface for adversarial trivia question-writing0
A Context-aware Attention Network for Interactive Question Answering0
Dynamic Adaptive Network Intelligence0
A Novel Framework for Robustness Analysis of Visual QA Models0
BioAMA: Towards an End to End BioMedical Question Answering System0
DUT-NLP at MEDIQA 2019: An Adversarial Multi-Task Network to Jointly Model Recognizing Question Entailment and Question Answering0
DUTH at SemEval-2019 Task 8: Part-Of-Speech Features for Question Classification0
Binding Touch to Everything: Learning Unified Multimodal Tactile Representations0
A Novel Discriminative Framework for Sentence-Level Discourse Analysis0
A Full End-to-End Semantic Role Labeler, Syntactic-agnostic Over Syntactic-aware?0
DUT-BIM at MEDIQA 2019: Utilizing Transformer Network and Medical Domain-Specific Contextualized Representations for Question Answering0
Duplicate Question Retrieval and Confirmation Time Prediction in Software Communities0
DUBLIN -- Document Understanding By Language-Image Network0
A Novel DeBERTa-based Model for Financial Question Answering Task0
DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning0
Bilingually-constrained Synthetic Data for Implicit Discourse Relation Recognition0
A Novel Computational and Modeling Foundation for Automatic Coherence Assessment0
A Frustratingly Easy Improvement for Position Embeddings via Random Padding0
A Constituent Syntactic Parse Tree Based Discourse Parser0
Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering0
DualNet: Domain-Invariant Network for Visual Question Answering0
Bilingual Event Extraction: a Case Study on Trigger Type Determination0
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems0
Bilingual Active Learning for Relation Classification via Pseudo Parallel Corpora0
A Novel Attention-based Aggregation Function to Combine Vision and Language0
Dual Embeddings and Metrics for Relational Similarity0
Dual Constrained Question Embeddings with Relational Knowledge Bases for Simple Question Answering0
Dual Co-Matching Network for Multi-choice Reading Comprehension0
BigText-QA: Question Answering over a Large-Scale Hybrid Knowledge Graph0
A Novel Architecture For Question Classification Based Indexing Scheme For Efficient Question Answering0
Dual Capsule Attention Mask Network with Mutual Learning for Visual Question Answering0
Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants0
A Novel Approach Towards Incorporating Context Processing Capabilities in NLIDB System0
Bidirectional Long Short-Term Memory Networks for Relation Classification0
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines0
Bidirectional Language Models Are Also Few-shot Learners0
A Novel Approach for Automatic Bengali Question Answering System using Semantic Similarity Analysis0
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset0
A Constituent-Centric Neural Architecture for Reading Comprehension0
DSNDM: Deep Siamese Neural Discourse Model with Attention for Text Pairs Categorization and Ranking0
Show:102550
← PrevPage 111 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified