Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Jan 15, 2025 Natural Language Understanding RAG
Code Code Available 55 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Dec 18, 2024 Decoder Retrieval
Code Code Available 55 TrustRAG: An Information Assistant with Retrieval Augmented Generation Feb 19, 2025 Answer Generation Chunking
Code Code Available 55 MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation Jan 12, 2025 RAG Retrieval
Code Code Available 55 OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Nov 21, 2024 Retrieval
Code Code Available 55 Make Your LLM Fully Utilize the Context Apr 25, 2024 4k Information Retrieval
Code Code Available 55 RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism Jun 30, 2025 Question Answering RAG
Code Code Available 55 ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge Mar 24, 2023 Information Retrieval Language Modeling
Code Code Available 45 Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention Apr 10, 2024 Book summarization Language Modeling
Code Code Available 45 Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Oct 17, 2023 Fact Verification Question Answering
Code Code Available 45 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Jun 11, 2024 4k Language Modeling
Code Code Available 45 s3: You Don't Need That Much Data to Train a Search Agent via RL May 20, 2025 RAG Reinforcement Learning (RL)
Code Code Available 45 LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation Nov 7, 2024 Contrastive Learning Image Captioning
Code Code Available 45 Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering Feb 26, 2024 Evidence Selection Open-Ended Question Answering
Code Code Available 45 RETSim: Resilient and Efficient Text Similarity Nov 28, 2023 Adversarial Text Clustering
Code Code Available 45 SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis May 22, 2025 Diversity Information Retrieval
Code Code Available 45 Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks May 22, 2020 Fact Verification Question Answering
Code Code Available 45 Retrieval-Augmented Generation for Large Language Models: A Survey Dec 18, 2023 Hallucination RAG
Code Code Available 45 Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions Aug 1, 2024 Medical Question Answering MedQA
Code Code Available 45 Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA Feb 9, 2024 Event Detection Hate Speech Detection
Code Code Available 45 Retrieval-Augmented Generation with Hierarchical Knowledge Mar 13, 2025 Multi-hop Question Answering Question Answering
Code Code Available 45 Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard Jun 13, 2023 Information Retrieval Representation Learning
Code Code Available 45 AlignScore: Evaluating Factual Consistency with a Unified Alignment Function May 26, 2023 Fact Verification Information Retrieval
Code Code Available 45 Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 45 Retrieval-Generation Synergy Augmented Large Language Models Oct 8, 2023 Question Answering Retrieval
Code Code Available 45 SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes Feb 13, 2023 Information Retrieval Retrieval
Code Code Available 45 G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering Feb 12, 2024 Common Sense Reasoning Graph Classification
Code Code Available 45 Halu-J: Critique-Based Hallucination Judge Jul 17, 2024 Evidence Selection Hallucination
Code Code Available 45 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation Feb 4, 2025 Benchmarking Information Retrieval
Code Code Available 45 R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning May 22, 2025 Memorization RAG
Code Code Available 45 Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization Apr 2, 2024 RAG Retrieval
Code Code Available 45 Goldfish: Vision-Language Understanding of Arbitrarily Long Videos Jul 17, 2024 Retrieval Video Understanding
Code Code Available 45 Gated Delta Networks: Improving Mamba2 with Delta Rule Dec 9, 2024 Common Sense Reasoning Language Modeling
Code Code Available 45 From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents Jun 23, 2025 Information Retrieval Retrieval
Code Code Available 45 Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers Jul 14, 2022 Retrieval Text Retrieval
Code Code Available 45 PLAID: An Efficient Engine for Late Interaction Retrieval May 19, 2022 CPU GPU
Code Code Available 45 Generative Representational Instruction Tuning Feb 15, 2024 Language Modeling Language Modelling
Code Code Available 45 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Feb 29, 2024 Retrieval Text Retrieval
Code Code Available 45 Benchmarking Retrieval-Augmented Generation for Medicine Feb 20, 2024 Benchmarking Information Retrieval
Code Code Available 45 Prompt2Model: Generating Deployable Models from Natural Language Instructions Aug 23, 2023 Data-free Knowledge Distillation Dataset Generation
Code Code Available 45 ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding Jan 14, 2025 RAG Retrieval
Code Code Available 45 Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval Sep 14, 2024 Contrastive Learning Image Retrieval
Code Code Available 45 A Survey of LLM DATA May 24, 2025 Large Language Model Management
Code Code Available 45 Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language Models Mar 11, 2025 Form Information Retrieval
Code Code Available 45 One Embedder, Any Task: Instruction-Finetuned Text Embeddings Dec 19, 2022 Information Retrieval Learning Word Embeddings
Code Code Available 45 EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations Oct 14, 2024 Answer Generation Question Answering
Code Code Available 45 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads Oct 14, 2024 GPU Quantization
Code Code Available 45 FG-CLIP: Fine-Grained Visual and Textual Alignment May 8, 2025 Image-text Retrieval object-detection
Code Code Available 45 OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit May 12, 2025 GPU Privacy Preserving
Code Code Available 45 2D Matryoshka Sentence Embeddings Feb 22, 2024 RAG Representation Learning
Code Code Available 45