SOTAVerified

LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge Graphs

2024-10-08Code Available2· sign in to hype

Vincent Emonet, Jerven Bolleman, Severine Duvaud, Tarcisio Mendes de Farias, Ana Claudia Sima

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We introduce a Retrieval-Augmented Generation (RAG) system for translating user questions into accurate federated SPARQL queries over bioinformatics knowledge graphs (KGs) leveraging Large Language Models (LLMs). To enhance accuracy and reduce hallucinations in query generation, our system utilises metadata from the KGs, including query examples and schema information, and incorporates a validation step to correct generated queries. The system is available online at chat.expasy.org.

Tasks

Reproductions