SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status	Hype
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System	Mar 18, 2025	BenchmarkingIn-Context Learning	CodeCode Available	1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding	Mar 16, 2025	Autonomous DrivingRAG	CodeCode Available	1
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction	Mar 3, 2025	RAGRetrieval	CodeCode Available	1
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development	Mar 2, 2025	Code GenerationProgram Synthesis	CodeCode Available	1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models	Feb 28, 2025	AttributeAutonomous Driving	CodeCode Available	1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking	Feb 28, 2025	RAGRetrieval	CodeCode Available	1
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation	Feb 28, 2025	ArticlesBenchmarking	CodeCode Available	1
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Feb 27, 2025	Information RetrievalKnowledge Graphs	CodeCode Available	1
EgoNormia: Benchmarking Physical Social Norm Understanding	Feb 27, 2025	Answer GenerationBenchmarking	CodeCode Available	1
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models	Feb 27, 2025	Question AnsweringRAG	CodeCode Available	1

Title

Status

Hype

JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System

CodeCode Available

Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding

CodeCode Available

SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction

CodeCode Available

GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development

CodeCode Available

SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models