SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 58015850 of 10817 papers

TitleStatusHype
Divide & Conquer for Entailment-aware Multi-hop Evidence Retrieval0
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual0
Learn to Unlearn: Meta-Learning-Based Knowledge Graph Embedding Unlearning0
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism0
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs0
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents0
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain0
Linking, Searching, and Visualizing Entities in Wikipedia0
BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis0
LIORI at SemEval-2021 Task 2: Span Prediction and Binary Classification approaches to Word-in-Context Disambiguation0
A Factoid Question Answering System for Vietnamese0
LIPN-CORE: Semantic Text Similarity using n-grams, WordNet, Syntactic Analysis, ESA and Information Retrieval based Features0
A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions0
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow0
Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations0
Listening to the Wise Few: Select-and-Copy Attention Heads for Multiple-Choice QA0
Learning Word Embeddings for Low-Resource Languages by PU Learning0
LIST-LUX: Disorder Identification from Clinical Texts0
Learning with Limited Data for Multilingual Reading Comprehension0
LiT-4-RSVQA: Lightweight Transformer-based Visual Question Answering in Remote Sensing0
Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA0
Diversity Enhances an LLM's Performance in RAG and Long-context Task0
Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data0
Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision0
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation0
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP0
An Interactive Web-Interface for Visualizing the Inner Workings of the Question Answering LSTM0
LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval0
Learning Visual Knowledge Memory Networks for Visual Question Answering0
LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering0
Learning Verb Inference Rules from Linguistically-Motivated Evidence0
LK2022 at Qur’an QA 2022: Simple Transformers Model for Finding Answers to Questions from Qur’an0
Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering0
Learning Transferable Representation for Bilingual Relation Extraction via Convolutional Neural Networks0
Bengali Question Classification: Towards Developing QA System0
Learning Trajectory-Word Alignments for Video-Language Tasks0
Learning to Transpile AMR into SPARQL0
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer0
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation0
Learning to Transpile AMR into SPARQL0
Learning to Translate for Multilingual Question Answering0
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning0
Diversifying Question Generation over Knowledge Base via External Natural Questions0
An Interactive System for Exploring Community Question Answering Forums0
A Fact Checking and Verification System for FEVEROUS Using a Zero-Shot Learning Approach0
Learning to Temporally Order Medical Events in Clinical Text0
Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions0
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs0
Learning to Summarise Related Sentences0
Learning to Specialize with Knowledge Distillation for Visual Question Answering0
Show:102550
← PrevPage 117 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified