HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims

2024-10-16Code Available1· sign in to hype

Yejun Yoon, JaeYoon Jung, Seunghyun Yoon, Kunwoo Park

Code Available — Be the first to reproduce this paper.

Code

github.com/ssu-humane/hero
OfficialIn paperpytorch★ 14

Abstract

To tackle the AVeriTeC shared task hosted by the FEVER-24, we introduce a system that only employs publicly available large language models (LLMs) for each step of automated fact-checking, dubbed the Herd of Open LLMs for verifying real-world claims (HerO). For evidence retrieval, a language model is used to enhance a query by generating hypothetical fact-checking documents. We prompt pretrained and fine-tuned LLMs for question generation and veracity prediction by crafting prompts with retrieved in-context samples. HerO achieved 2nd place on the leaderboard with the AVeriTeC score of 0.57, suggesting the potential of open LLMs for verifying real-world claims. For future research, we make our code publicly available at https://github.com/ssu-humane/HerO.

Tasks

Fact Checking Language Modeling Language Modelling Question Generation Question-Generation Retrieval

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
AVeriTeC	HerO	Question Only score	0.48	—	Unverified

HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims

Code

Abstract

Tasks

Benchmark Results

Reproductions