| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 |
| MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval | Jul 2, 2023 | Biomedical Information RetrievalContrastive Learning | CodeCode Available | 2 |
| SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning | Jun 21, 2023 | SentenceText Generation | CodeCode Available | 2 |
| Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Jun 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Improving CLIP Training with Language Rewrites | May 31, 2023 | In-Context LearningSentence | CodeCode Available | 2 |
| IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages | May 25, 2023 | AllMachine Translation | CodeCode Available | 2 |
| HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation | May 19, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | May 4, 2023 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |