SOTAVerified

Open-Domain Question Answering

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Papers

Showing 351400 of 494 papers

TitleStatusHype
Efficient Retrieval Optimized Multi-task Learning0
Enabling Transitivity for Lexical Inference on Chinese Verbs Using Probabilistic Soft Logic0
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering0
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization0
Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning0
Dynamic Retrieval-Augmented Generation0
ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval0
Evaluation of baseline information retrieval for Polish open-domain Question Answering system0
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks0
Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs0
FastHybrid: A Hybrid Model for Efficient Answer Selection0
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation0
FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering0
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction0
Focused Hierarchical RNNs for Conditional Sequence Processing0
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data0
FriendsQA: Open-Domain Question Answering on TV Show Transcripts0
From Retrieval to Generation: Comparing Different Approaches0
Geographic Question Answering: Challenges, Uniqueness, Classification, and Future Directions0
Get Your Model Puzzled: Introducing Crossword-Solving as a New NLP Benchmark0
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking0
HAS-QA: Hierarchical Answer Spans Model for Open-domain Question Answering0
Higher-order Lexical Semantic Models for Non-factoid Answer Reranking0
Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks0
HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions0
How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering0
HuRIC: a Human Robot Interaction Corpus0
Hyperlink-induced Pre-training for Passage Retrieval of Open-domain Question Answering0
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions0
Improve Dense Passage Retrieval with Entailment Tuning0
Improving Biomedical Information Retrieval with Neural Retrievers0
Improving Conditioning in Context-Aware Sequence to Sequence Models0
Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation0
Improving Long Text Understanding with Knowledge Distilled from Summarization Model0
PolQA: Polish Question Answering Dataset0
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering0
Inferring Binary Relation Schemas for Open Information Extraction0
Inner Attention based Recurrent Neural Networks for Answer Selection0
In Situ Answer Sentence Selection at Web-scale0
Internet-augmented language models through few-shot prompting for open-domain question answering0
Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation0
Investigating Information Inconsistency in Multilingual Open-Domain Question Answering0
Is Retriever Merely an Approximator of Reader?0
Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval0
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking0
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering0
KGI: An Integrated Framework for Knowledge Intensive Language Tasks0
Knowledge-Aided Open-Domain Question Answering0
Knowledge-Aware Iterative Retrieval for Multi-Agent Systems0
Knowledge Fusion and Semantic Knowledge Ranking for Open Domain Question Answering0
Show:102550
← PrevPage 8 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1somebodyKILT-RL2.62Unverified
2WikipediaKILT-RL2.46Unverified
3arxiv.org/abs/2103.06332KILT-RL2.36Unverified
4BART + DPRKILT-RL1.9Unverified
5RAGKILT-RL1.69Unverified
6T5-baseKILT-RL0Unverified
7GENREKILT-RL0Unverified
8Multi-task DPRKILT-RL0Unverified
9BARTKILT-RL0Unverified
10Training Set Retrieval (top 1)KILT-RL0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM43.56Unverified
2intersectKILT-EM38.78Unverified
3KGI_0KILT-EM36.36Unverified
4WikipediaKILT-EM35.32Unverified
5RAGKILT-EM32.69Unverified
6BERT + DPRKILT-EM31.99Unverified
7BART + DPRKILT-EM30.06Unverified
8Multitask DPR + BARTKILT-EM29.09Unverified
9SphereKILT-EM0Unverified
10T5-baseKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM57.91Unverified
2intersectKILT-EM50.56Unverified
3WikipediaKILT-EM45.55Unverified
4KGI_0KILT-EM42.85Unverified
5Multitask DPR + BARTKILT-EM42.36Unverified
6RAGKILT-EM38.13Unverified
7BERT + DPRKILT-EM34.48Unverified
8BART + DPRKILT-EM31.4Unverified
9Multi-task DPRKILT-EM0Unverified
10SphereKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1intersectKILT-EM18.06Unverified
2WikipediaKILT-EM11.71Unverified
3Multitask DPR + BARTKILT-EM9.53Unverified
4RAGKILT-EM3.21Unverified
5BART + DPRKILT-EM1.96Unverified
6BERT + DPRKILT-EM0.74Unverified
7SphereKILT-EM0Unverified
8Multi-task DPRKILT-EM0Unverified
9GENREKILT-EM0Unverified
10chriskueiKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1SpanBERTF184.8Unverified
2Cluster-Former (#C=512)EM68Unverified
3Locality-Sensitive HashingEM66Unverified
4Multi-passage BERTEM65.1Unverified
5Sparse AttentionEM64.7Unverified
6DECAPROPEM62.2Unverified
7Bi-Attention + DCU-LSTMN-gram F159.5Unverified
8Denoising QAEM58.8Unverified
9DecaPropEM56.8Unverified
10AMANDAN-gram F156.6Unverified
#ModelMetricClaimedVerifiedStatus
1Fourier TransformerRouge-L26.9Unverified
2QGRouge-L26.4Unverified
3BARTRouge-L24.3Unverified
4E-MCARouge-L24Unverified
5Transformer Multitask + LayerDropRouge-L23.4Unverified
6Multi-InrerleaveRouge-L14.63Unverified
#ModelMetricClaimedVerifiedStatus
1Evidence Aggregation via R^3 Re-RankingEM (Quasar-T)42.3Unverified
2Denoising QAEM (Quasar-T)42.2Unverified
3DecaPropEM (Quasar-T)38.6Unverified
4R^3EM (Quasar-T)35.3Unverified
5GAEM (Quasar-T)26.4Unverified
6BiDAFEM (Quasar-T)25.9Unverified
#ModelMetricClaimedVerifiedStatus
1FiEExact Match58.4Unverified
2R2-D2 HN-DPRExact Match55.9Unverified
3UniK-QAExact Match54.9Unverified
4UnitedQA (Hybrid)Exact Match54.7Unverified
5BPR (linear scan; l=1000)Exact Match41.6Unverified
#ModelMetricClaimedVerifiedStatus
1SPARTAEM59.3Unverified
2Blended RAGEM57.63Unverified
3BERTseriniEM50.2Unverified
4BERTseriniEM38.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match57.7Unverified
2FiE+PAQExact Match56.3Unverified
3FiEExact Match52.4Unverified
4EMDR2Exact Match48.7Unverified
#ModelMetricClaimedVerifiedStatus
1DrQAEM70Unverified
2DCNEM66.2Unverified
3MPCMEM65.5Unverified
#ModelMetricClaimedVerifiedStatus
1ERNIE 2.0 LargeEM64.2Unverified
2ERNIE 2.0 BaseEM61.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match65.5Unverified
2BPR (linear scan; l=1000)Exact Match56.8Unverified
#ModelMetricClaimedVerifiedStatus
1EMDR2Exact Match52.5Unverified
#ModelMetricClaimedVerifiedStatus
1UnitedQA (Hybrid)Exact Match70.5Unverified