Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey

2024-09-20Code Available1· sign in to hype

Sourav Verma

Code Available — Be the first to reproduce this paper.

Code

github.com/srgrace/contextual-compression
OfficialIn paperpytorch★ 17

Abstract

Large Language Models (LLMs) showcase remarkable abilities, yet they struggle with limitations such as hallucinations, outdated knowledge, opacity, and inexplicable reasoning. To address these challenges, Retrieval-Augmented Generation (RAG) has proven to be a viable solution, leveraging external databases to improve the consistency and coherence of generated content, especially valuable for complex, knowledge-rich tasks, and facilitates continuous improvement by leveraging domain-specific insights. By combining the intrinsic knowledge of LLMs with the vast, dynamic repositories of external databases, RAG achieves a synergistic effect. However, RAG is not without its limitations, including a limited context window, irrelevant information, and the high processing overhead for extensive contextual data. In this comprehensive work, we explore the evolution of Contextual Compression paradigms, providing an in-depth examination of the field. Finally, we outline the current challenges and suggest potential research and development directions, paving the way for future advancements in this area.

Tasks

RAG Retrieval Retrieval-augmented Generation

Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey

Code

Abstract

Tasks

Reproductions