| TrustRAG: An Information Assistant with Retrieval Augmented Generation | Feb 19, 2025 | Answer GenerationChunking | CodeCode Available | 5 |
| Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG | Jan 15, 2025 | Natural Language UnderstandingRAG | CodeCode Available | 5 |
| MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation | Jan 12, 2025 | RAGRetrieval | CodeCode Available | 5 |
| Search-o1: Agentic Search-Enhanced Large Reasoning Models | Jan 9, 2025 | Code Generation | CodeCode Available | 5 |
| Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks | Dec 20, 2024 | AllRAG | CodeCode Available | 5 |
| OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Dec 10, 2024 | AttributeBenchmarking | CodeCode Available | 5 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation | Aug 15, 2024 | DiagnosticRAG | CodeCode Available | 5 |
| Retrieval-Augmented Generation for AI-Generated Content: A Survey | Feb 29, 2024 | Information RetrievalLarge Language Model | CodeCode Available | 5 |
| A Survey of LLM DATA | May 24, 2025 | Large Language ModelManagement | CodeCode Available | 4 |
| SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis | May 22, 2025 | DiversityInformation Retrieval | CodeCode Available | 4 |
| R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | May 22, 2025 | MemorizationRAG | CodeCode Available | 4 |
| s3: You Don't Need That Much Data to Train a Search Agent via RL | May 20, 2025 | RAGReinforcement Learning (RL) | CodeCode Available | 4 |
| OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit | May 12, 2025 | GPUPrivacy Preserving | CodeCode Available | 4 |
| DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments | Apr 4, 2025 | NavigatePrompt Engineering | CodeCode Available | 4 |
| Retrieval-Augmented Generation with Hierarchical Knowledge | Mar 13, 2025 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 4 |
| ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents | Feb 25, 2025 | Question AnsweringRAG | CodeCode Available | 4 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| Training Sparse Mixture Of Experts Text Embedding Models | Feb 11, 2025 | Mixture-of-ExpertsRAG | CodeCode Available | 4 |
| Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation | Feb 4, 2025 | BenchmarkingInformation Retrieval | CodeCode Available | 4 |
| ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding | Jan 14, 2025 | RAGRetrieval | CodeCode Available | 4 |
| EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations | Oct 14, 2024 | Answer GenerationQuestion Answering | CodeCode Available | 4 |
| VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents | Oct 14, 2024 | RAGRetrieval | CodeCode Available | 4 |
| When Does Perceptual Alignment Benefit Vision Representations? | Oct 14, 2024 | Depth EstimationImage Generation | CodeCode Available | 4 |
| Data-Prep-Kit: getting your data ready for LLM application development | Sep 26, 2024 | CPULanguage Modeling | CodeCode Available | 4 |