| RAR-b: Reasoning as Retrieval Benchmark | Apr 9, 2024 | Information RetrievalRAG | CodeCode Available | 1 |
| ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval | Feb 24, 2024 | DecoderReranking | CodeCode Available | 1 |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Feb 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training | Jan 4, 2024 | DescriptiveImage Captioning | CodeCode Available | 1 |
| Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Dec 14, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| Functional Overlap Reranking for Neural Code Generation | Oct 16, 2023 | Code GenerationReranking | CodeCode Available | 1 |
| Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models | Oct 11, 2023 | Passage RerankingReranking | CodeCode Available | 1 |
| HypR: A comprehensive study for ASR hypothesis revising with a reference corpus | Sep 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Generative Flow Network for Listwise Recommendation | Jun 4, 2023 | DiversityRecommendation Systems | CodeCode Available | 1 |