| Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs | Jun 4, 2024 | Question AnsweringTriviaQA | CodeCode Available | 1 |
| LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | May 31, 2024 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Accurate and Nuanced Open-QA Evaluation Through Textual Entailment | May 26, 2024 | Natural Language InferenceOpen-Domain Question Answering | CodeCode Available | 0 |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Apr 25, 2024 | GSM8KHellaSwag | CodeCode Available | 3 |
| KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering | Apr 24, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| FIT-RAG: Black-Box RAG with Factual Information and Token Reduction | Mar 21, 2024 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| Unfamiliar Finetuning Examples Control How Language Models Hallucinate | Mar 8, 2024 | MMLUMultiple-choice | CodeCode Available | 1 |
| Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering | Mar 8, 2024 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents | Feb 27, 2024 | Known UnknownsQuestion Answering | —Unverified | 0 |
| Fine-Grained Self-Endorsement Improves Factuality and Reasoning | Feb 23, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate | Feb 9, 2024 | Question AnsweringTriviaQA | —Unverified | 0 |
| Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing | Jan 10, 2024 | DecoderReading Comprehension | —Unverified | 0 |
| Efficient Transformer Knowledge Distillation: A Performance Review | Nov 22, 2023 | Knowledge DistillationModel Compression | —Unverified | 0 |
| Noisy Pair Corrector for Dense Retrieval | Nov 7, 2023 | Code SearchRetrieval | —Unverified | 0 |
| A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models | Oct 9, 2023 | Image GenerationQuestion Answering | CodeCode Available | 0 |
| Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference | Sep 16, 2023 | Instruction FollowingQuestion Answering | —Unverified | 0 |
| Generator-Retriever-Generator Approach for Open-Domain Question Answering | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| When to Read Documents or QA History: On Unified and Selective Open-domain QA | Jun 7, 2023 | Natural QuestionsOpen-Domain Question Answering | —Unverified | 0 |
| Exploiting Abstract Meaning Representation for Open-Domain Question Answering | May 26, 2023 | Abstract Meaning RepresentationDiversity | CodeCode Available | 1 |
| RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question Answering | May 26, 2023 | DecoderNatural Questions | CodeCode Available | 0 |
| Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback | May 24, 2023 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Allies: Prompting Large Language Model with Beam Search | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing | May 19, 2023 | Fact CheckingNatural Questions | CodeCode Available | 0 |