SOTAVerified

Open-Domain Question Answering

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Papers

Showing 151200 of 494 papers

TitleStatusHype
RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question AnsweringCode0
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question AnsweringCode0
Benchmarking LLM-based Relevance Judgment MethodsCode0
Re-Examining Calibration: The Case of Question AnsweringCode0
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-rankingCode0
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge BaseCode0
Cross-Pair Text Representations for Answer Sentence SelectionCode0
Allies: Prompting Large Language Model with Beam SearchCode0
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented GenerationCode0
Scaling Reasoning can Improve Factuality in Large Language ModelsCode0
Semi-Parametric Retrieval via Binary Bag-of-Tokens IndexCode0
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase IndexCode0
Attacking Open-domain Question Answering by Injecting MisinformationCode0
Ranking Paragraphs for Improving Answer Recall in Open-Domain Question AnsweringCode0
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language ModelsCode0
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMsCode0
Analysing the Robustness of Dual Encoders for Dense Retrieval Against MisspellingsCode0
R^3: Reinforced Reader-Ranker for Open-Domain Question AnsweringCode0
Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic SupervisionCode0
Fine-tune the Entire RAG Architecture (including DPR retriever) for Question-AnsweringCode0
Combining Lexical and Dense Retrieval for Computationally Efficient Multi-hop Question AnsweringCode0
Passage-specific Prompt Tuning for Passage Reranking in Question Answering with Large Language ModelsCode0
Few-Shot Multilingual Open-Domain QA from 5 ExamplesCode0
RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long DocumentsCode0
Sequence to sequence pretraining for a less-resourced Slovenian languageCode0
Open-Domain Event Detection using Distant SupervisionCode0
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited QuestionsCode0
Open-Domain Question-Answering for COVID-19 and Other Emergent DomainsCode0
Exploiting Sentence-Level Representations for Passage RankingCode0
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of ReferencesCode0
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise TasksCode0
Evidence Sentence Extraction for Machine Reading ComprehensionCode0
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question AnsweringCode0
Neural Arabic Question AnsweringCode0
Closed-book Question Generation via Contrastive LearningCode0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
Multi-step Retriever-Reader Interaction for Scalable Open-domain Question AnsweringCode0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Entropy-Based Decoding for Retrieval-Augmented Large Language ModelsCode0
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQLCode0
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
NIR-Prompt: A Multi-task Generalized Neural Information Retrieval Training FrameworkCode0
Open Domain Question Answering over Tables via Dense RetrievalCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
End-to-End Open-Domain Question Answering with BERTseriniCode0
Challenges in Generalization in Open Domain Question AnsweringCode0
Dual-Encoders for Extreme Multi-Label ClassificationCode0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
Multi-Hop Paragraph Retrieval for Open-Domain Question AnsweringCode0
Language Models as Knowledge Bases?Code0
Show:102550
← PrevPage 4 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1somebodyKILT-RL2.62Unverified
2WikipediaKILT-RL2.46Unverified
3arxiv.org/abs/2103.06332KILT-RL2.36Unverified
4BART + DPRKILT-RL1.9Unverified
5RAGKILT-RL1.69Unverified
6Training Set Retrieval (top 1)KILT-RL0Unverified
7T5-baseKILT-RL0Unverified
8Input CopyingKILT-RL0Unverified
9SphereKILT-RL0Unverified
10Random Training Set AnswerKILT-RL0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM43.56Unverified
2intersectKILT-EM38.78Unverified
3KGI_0KILT-EM36.36Unverified
4WikipediaKILT-EM35.32Unverified
5RAGKILT-EM32.69Unverified
6BERT + DPRKILT-EM31.99Unverified
7BART + DPRKILT-EM30.06Unverified
8Multitask DPR + BARTKILT-EM29.09Unverified
9Multi-task DPRKILT-EM0Unverified
10SphereKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM57.91Unverified
2intersectKILT-EM50.56Unverified
3WikipediaKILT-EM45.55Unverified
4KGI_0KILT-EM42.85Unverified
5Multitask DPR + BARTKILT-EM42.36Unverified
6RAGKILT-EM38.13Unverified
7BERT + DPRKILT-EM34.48Unverified
8BART + DPRKILT-EM31.4Unverified
9TABiKILT-EM0Unverified
10T5-baseKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1intersectKILT-EM18.06Unverified
2WikipediaKILT-EM11.71Unverified
3Multitask DPR + BARTKILT-EM9.53Unverified
4RAGKILT-EM3.21Unverified
5BART + DPRKILT-EM1.96Unverified
6BERT + DPRKILT-EM0.74Unverified
7SphereKILT-EM0Unverified
8Multi-task DPRKILT-EM0Unverified
9GENREKILT-EM0Unverified
10chriskueiKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1SpanBERTF184.8Unverified
2Cluster-Former (#C=512)EM68Unverified
3Locality-Sensitive HashingEM66Unverified
4Multi-passage BERTEM65.1Unverified
5Sparse AttentionEM64.7Unverified
6DECAPROPEM62.2Unverified
7Bi-Attention + DCU-LSTMN-gram F159.5Unverified
8Denoising QAEM58.8Unverified
9DecaPropEM56.8Unverified
10AMANDAN-gram F156.6Unverified
#ModelMetricClaimedVerifiedStatus
1Fourier TransformerRouge-L26.9Unverified
2QGRouge-L26.4Unverified
3BARTRouge-L24.3Unverified
4E-MCARouge-L24Unverified
5Transformer Multitask + LayerDropRouge-L23.4Unverified
6Multi-InrerleaveRouge-L14.63Unverified
#ModelMetricClaimedVerifiedStatus
1Evidence Aggregation via R^3 Re-RankingEM (Quasar-T)42.3Unverified
2Denoising QAEM (Quasar-T)42.2Unverified
3DecaPropEM (Quasar-T)38.6Unverified
4R^3EM (Quasar-T)35.3Unverified
5GAEM (Quasar-T)26.4Unverified
6BiDAFEM (Quasar-T)25.9Unverified
#ModelMetricClaimedVerifiedStatus
1FiEExact Match58.4Unverified
2R2-D2 HN-DPRExact Match55.9Unverified
3UniK-QAExact Match54.9Unverified
4UnitedQA (Hybrid)Exact Match54.7Unverified
5BPR (linear scan; l=1000)Exact Match41.6Unverified
#ModelMetricClaimedVerifiedStatus
1SPARTAEM59.3Unverified
2Blended RAGEM57.63Unverified
3BERTseriniEM50.2Unverified
4BERTseriniEM38.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match57.7Unverified
2FiE+PAQExact Match56.3Unverified
3FiEExact Match52.4Unverified
4EMDR2Exact Match48.7Unverified
#ModelMetricClaimedVerifiedStatus
1DrQAEM70Unverified
2DCNEM66.2Unverified
3MPCMEM65.5Unverified
#ModelMetricClaimedVerifiedStatus
1ERNIE 2.0 LargeEM64.2Unverified
2ERNIE 2.0 BaseEM61.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match65.5Unverified
2BPR (linear scan; l=1000)Exact Match56.8Unverified
#ModelMetricClaimedVerifiedStatus
1EMDR2Exact Match52.5Unverified
#ModelMetricClaimedVerifiedStatus
1UnitedQA (Hybrid)Exact Match70.5Unverified