| COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval | Jun 2, 2024 | Information RetrievalRAG | CodeCode Available | 4 | 5 |
| ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding | Jan 14, 2025 | RAGRetrieval | CodeCode Available | 4 | 5 |
| s3: You Don't Need That Much Data to Train a Search Agent via RL | May 20, 2025 | RAGReinforcement Learning (RL) | CodeCode Available | 4 | 5 |
| A Survey of LLM DATA | May 24, 2025 | Large Language ModelManagement | CodeCode Available | 4 | 5 |
| Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization | Apr 2, 2024 | RAGRetrieval | CodeCode Available | 4 | 5 |
| R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | May 22, 2025 | MemorizationRAG | CodeCode Available | 4 | 5 |
| Generative Representational Instruction Tuning | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit | May 12, 2025 | GPUPrivacy Preserving | CodeCode Available | 4 | 5 |
| Data-Prep-Kit: getting your data ready for LLM application development | Sep 26, 2024 | CPULanguage Modeling | CodeCode Available | 4 | 5 |
| Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation | Aug 8, 2024 | ChunkingFact Checking | CodeCode Available | 4 | 5 |