SOTAVerified

Open-Domain Question Answering

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Papers

Showing 51100 of 494 papers

TitleStatusHype
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective AugmentationCode1
Making Retrieval-Augmented Language Models Robust to Irrelevant ContextCode1
Generator-Retriever-Generator Approach for Open-Domain Question AnsweringCode1
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval AugmentationCode1
Answering Ambiguous Questions via Iterative PromptingCode1
Conformal Language ModelingCode1
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question AnsweringCode1
Exploiting Abstract Meaning Representation for Open-Domain Question AnsweringCode1
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT OperatorCode1
On the Risk of Misinformation Pollution with Large Language ModelsCode1
Asking Clarification Questions to Handle Ambiguity in Open-Domain QACode1
Large Language Models are Built-in Autoregressive Search EnginesCode1
Open-WikiTable: Dataset for Open Domain Question Answering with Complex Reasoning over TableCode1
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought PromptingCode1
Evaluating Open-Domain Question Answering in the Era of Large Language ModelsCode1
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought FrameworkCode1
Chain-of-Skills: A Configurable Model for Open-domain Question AnsweringCode1
Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual NoiseCode1
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?Code1
Compositional Exemplars for In-context LearningCode1
Self-Prompting Large Language Models for Zero-Shot Open-Domain QACode1
Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single TransformerCode1
CREPE: Open-Domain Question Answering with False PresuppositionsCode1
Cross-document Event Coreference Search: Task, Dataset and ModelingCode1
Open-domain Question Answering via Chain of Reasoning over Heterogeneous KnowledgeCode1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question AnsweringCode1
Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question AnsweringCode1
Variational Open-Domain Question AnsweringCode1
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal RetrievalCode1
Re2G: Retrieve, Rerank, GenerateCode1
INSCIT: Information-Seeking Conversations with Mixed-Initiative InteractionsCode1
Optimizing Test-Time Query Representations for Dense RetrievalCode1
Few-shot Reranking for Multi-hop QA via Language Model PromptingCode1
Improving Passage Retrieval with Zero-Shot Question GenerationCode1
Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question AnsweringCode1
Simple Questions Generate Named Entity Recognition DatasetsCode1
Learning to Retrieve Passages without SupervisionCode1
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak DecoderCode1
Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence AnnotationCode1
Dense Hierarchical Retrieval for Open-Domain Question AnsweringCode1
Open Domain Question Answering with A Unified Knowledge InterfaceCode1
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-TrainingCode1
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?Code1
Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense RetrievalCode1
Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence AnnotationCode1
EntQA: Entity Linking as Question AnsweringCode1
Simple Entity-Centric Questions Challenge Dense RetrieversCode1
Phrase Retrieval Learns Passage Retrieval, TooCode1
Adaptive Information Seeking for Open-Domain Question AnsweringCode1
Show:102550
← PrevPage 2 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1somebodyKILT-RL2.62Unverified
2WikipediaKILT-RL2.46Unverified
3arxiv.org/abs/2103.06332KILT-RL2.36Unverified
4BART + DPRKILT-RL1.9Unverified
5RAGKILT-RL1.69Unverified
6Training Set Retrieval (top 1)KILT-RL0Unverified
7T5-baseKILT-RL0Unverified
8Input CopyingKILT-RL0Unverified
9SphereKILT-RL0Unverified
10Random Training Set AnswerKILT-RL0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM43.56Unverified
2intersectKILT-EM38.78Unverified
3KGI_0KILT-EM36.36Unverified
4WikipediaKILT-EM35.32Unverified
5RAGKILT-EM32.69Unverified
6BERT + DPRKILT-EM31.99Unverified
7BART + DPRKILT-EM30.06Unverified
8Multitask DPR + BARTKILT-EM29.09Unverified
9Multi-task DPRKILT-EM0Unverified
10SphereKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM57.91Unverified
2intersectKILT-EM50.56Unverified
3WikipediaKILT-EM45.55Unverified
4KGI_0KILT-EM42.85Unverified
5Multitask DPR + BARTKILT-EM42.36Unverified
6RAGKILT-EM38.13Unverified
7BERT + DPRKILT-EM34.48Unverified
8BART + DPRKILT-EM31.4Unverified
9TABiKILT-EM0Unverified
10T5-baseKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1intersectKILT-EM18.06Unverified
2WikipediaKILT-EM11.71Unverified
3Multitask DPR + BARTKILT-EM9.53Unverified
4RAGKILT-EM3.21Unverified
5BART + DPRKILT-EM1.96Unverified
6BERT + DPRKILT-EM0.74Unverified
7SphereKILT-EM0Unverified
8Multi-task DPRKILT-EM0Unverified
9GENREKILT-EM0Unverified
10chriskueiKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1SpanBERTF184.8Unverified
2Cluster-Former (#C=512)EM68Unverified
3Locality-Sensitive HashingEM66Unverified
4Multi-passage BERTEM65.1Unverified
5Sparse AttentionEM64.7Unverified
6DECAPROPEM62.2Unverified
7Bi-Attention + DCU-LSTMN-gram F159.5Unverified
8Denoising QAEM58.8Unverified
9DecaPropEM56.8Unverified
10AMANDAN-gram F156.6Unverified
#ModelMetricClaimedVerifiedStatus
1Fourier TransformerRouge-L26.9Unverified
2QGRouge-L26.4Unverified
3BARTRouge-L24.3Unverified
4E-MCARouge-L24Unverified
5Transformer Multitask + LayerDropRouge-L23.4Unverified
6Multi-InrerleaveRouge-L14.63Unverified
#ModelMetricClaimedVerifiedStatus
1Evidence Aggregation via R^3 Re-RankingEM (Quasar-T)42.3Unverified
2Denoising QAEM (Quasar-T)42.2Unverified
3DecaPropEM (Quasar-T)38.6Unverified
4R^3EM (Quasar-T)35.3Unverified
5GAEM (Quasar-T)26.4Unverified
6BiDAFEM (Quasar-T)25.9Unverified
#ModelMetricClaimedVerifiedStatus
1FiEExact Match58.4Unverified
2R2-D2 HN-DPRExact Match55.9Unverified
3UniK-QAExact Match54.9Unverified
4UnitedQA (Hybrid)Exact Match54.7Unverified
5BPR (linear scan; l=1000)Exact Match41.6Unverified
#ModelMetricClaimedVerifiedStatus
1SPARTAEM59.3Unverified
2Blended RAGEM57.63Unverified
3BERTseriniEM50.2Unverified
4BERTseriniEM38.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match57.7Unverified
2FiE+PAQExact Match56.3Unverified
3FiEExact Match52.4Unverified
4EMDR2Exact Match48.7Unverified
#ModelMetricClaimedVerifiedStatus
1DrQAEM70Unverified
2DCNEM66.2Unverified
3MPCMEM65.5Unverified
#ModelMetricClaimedVerifiedStatus
1ERNIE 2.0 LargeEM64.2Unverified
2ERNIE 2.0 BaseEM61.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match65.5Unverified
2BPR (linear scan; l=1000)Exact Match56.8Unverified
#ModelMetricClaimedVerifiedStatus
1EMDR2Exact Match52.5Unverified
#ModelMetricClaimedVerifiedStatus
1UnitedQA (Hybrid)Exact Match70.5Unverified