| Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation | May 25, 2023 | Hallucination Pair-wise Detection (1-ref)Informativeness | CodeCode Available | 1 |
| DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 | May 24, 2023 | Informativeness | —Unverified | 0 |
| Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks | May 24, 2023 | InformativenessQuestion Answering | —Unverified | 0 |
| AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content | May 24, 2023 | Document Summarizationdocument understanding | —Unverified | 0 |
| Coverage-based Example Selection for In-Context Learning | May 24, 2023 | In-Context LearningInformativeness | CodeCode Available | 1 |
| Prompting Language-Informed Distribution for Compositional Zero-Shot Learning | May 23, 2023 | Compositional Zero-Shot LearningInformativeness | CodeCode Available | 1 |
| μPLAN: Summarizing using a Content Plan as Cross-Lingual Bridge | May 23, 2023 | Informativeness | —Unverified | 0 |
| APPLS: Evaluating Evaluation Metrics for Plain Language Summarization | May 23, 2023 | InformativenessLanguage Modelling | CodeCode Available | 0 |
| Task-Oriented Communication with Out-of-Distribution Detection: An Information Bottleneck Framework | May 21, 2023 | InformativenessOut-of-Distribution Detection | CodeCode Available | 0 |
| Writing your own book: A method for going from closed to open book QA to improve robustness and performance of smaller LLMs | May 18, 2023 | InformativenessQuestion Answering | —Unverified | 0 |