SOTAVerified

Oracle-Checker Scheme for Evaluating a Generative Large Language Model

2024-05-06Unverified0· sign in to hype

Yueling Jenny Zeng, Li-C. Wang, Thomas Ibbetson

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This work presents a novel approach called oracle-checker scheme for evaluating the answer given by a generative large language model (LLM). Two types of checkers are presented. The first type of checker follows the idea of property testing. The second type of checker follows the idea of program checking. Their applications are demonstrated in two separate contexts, entity extraction and paraphrase decision, respectively.

Tasks

Reproductions