| MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery | Sep 9, 2024 | MemorizationQuestion Answering | CodeCode Available | 7 | 5 |
| Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling | Apr 3, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 6 | 5 |
| LIMO: Less is More for Reasoning | Feb 5, 2025 | MathMathematical Reasoning | CodeCode Available | 5 | 5 |
| R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | May 22, 2025 | MemorizationRAG | CodeCode Available | 4 | 5 |
| MUSE: Machine Unlearning Six-Way Evaluation for Language Models | Jul 8, 2024 | ArticlesMachine Unlearning | CodeCode Available | 4 | 5 |
| Parameter Efficient Instruction Tuning: An Empirical Study | Nov 25, 2024 | Instruction FollowingMemorization | CodeCode Available | 4 | 5 |
| Amortized Planning with Large-Scale Transformers: A Case Study on Chess | Feb 7, 2024 | Memorization | CodeCode Available | 4 | 5 |
| Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models | Jun 9, 2022 | Common Sense ReasoningMath | CodeCode Available | 4 | 5 |
| Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets | Jan 6, 2022 | Memorization | CodeCode Available | 4 | 5 |
| VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling | Dec 31, 2024 | Memorization | CodeCode Available | 4 | 5 |