| Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness | Jan 21, 2023 | DiagnosticNatural Questions | CodeCode Available | 1 |
| On the Effectiveness of Parameter-Efficient Fine-Tuning | Nov 28, 2022 | Natural Questionsparameter-efficient fine-tuning | CodeCode Available | 1 |
| DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering | Nov 10, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| Recitation-Augmented Language Models | Oct 4, 2022 | Natural QuestionsQuestion Answering | CodeCode Available | 1 |
| MultiSpanQA: A Dataset for Multi-Span Question Answering | Jul 1, 2022 | Natural QuestionsQuestion Answering | CodeCode Available | 1 |
| Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering | May 25, 2022 | Graph Question AnsweringNatural Questions | CodeCode Available | 1 |
| Table Retrieval May Not Necessitate Table-specific Model Design | May 19, 2022 | Hard AttentionNatural Questions | CodeCode Available | 1 |
| How Do We Answer Complex Questions: Discourse Structure of Long-form Answers | Mar 21, 2022 | FormNatural Questions | CodeCode Available | 1 |
| Asyncval: A Toolkit for Asynchronously Validating Dense Retriever Checkpoints during Training | Feb 25, 2022 | GPUNatural Questions | CodeCode Available | 1 |
| Text and Code Embeddings by Contrastive Pre-Training | Jan 24, 2022 | Code SearchLinear-Probe Classification | CodeCode Available | 1 |