Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models

2024-10-12Code Available0· sign in to hype

Ivo Verhoeven, Pushkar Mishra, Ekaterina Shutova

Code Available — Be the first to reproduce this paper.

Code

github.com/ioverho/misinfo-general
Officialpytorch★ 4

Abstract

This paper introduces misinfo-general, a benchmark dataset for evaluating misinformation models' ability to perform out-of-distribution generalisation. Misinformation changes rapidly, much quicker than moderators can annotate at scale, resulting in a shift between the training and inference data distributions. As a result, misinformation models need to be able to perform out-of-distribution generalisation, an understudied problem in existing datasets. We identify 6 axes of generalisation-time, event, topic, publisher, political bias, misinformation type-and design evaluation procedures for each. We also analyse some baseline models, highlighting how these fail important desiderata.

Tasks

Benchmarking Misinformation

Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models

Code

Abstract

Tasks

Reproductions