SOTAVerified

GINGER: Grounded Information Nugget-Based Generation of Responses

2025-03-23Code Available0· sign in to hype

Weronika Łajewska, Krisztian Balog

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Retrieval-augmented generation (RAG) faces challenges related to factual correctness, source attribution, and response completeness. To address them, we propose a modular pipeline for grounded response generation that operates on information nuggets-minimal, atomic units of relevant information extracted from retrieved documents. The multistage pipeline encompasses nugget detection, clustering, ranking, top cluster summarization, and fluency enhancement. It guarantees grounding in specific facts, facilitates source attribution, and ensures maximum information inclusion within length constraints. Extensive experiments on the TREC RAG'24 dataset evaluated with the AutoNuggetizer framework demonstrate that GINGER achieves state-of-the-art performance on this benchmark.

Tasks

Reproductions