SOTAVerified

A Generalised and Adaptable Reinforcement Learning Stopping Method

2025-05-03Code Available0· sign in to hype

Reem Bin-Hezam, Mark Stevenson

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper presents a Technology Assisted Review (TAR) stopping approach based on Reinforcement Learning (RL). Previous such approaches offered limited control over stopping behaviour, such as fixing the target recall and tradeoff between preferring to maximise recall or cost. These limitations are overcome by introducing a novel RL environment, GRLStop, that allows a single model to be applied to multiple target recalls, balances the recall/cost tradeoff and integrates a classifier. Experiments were carried out on six benchmark datasets (CLEF e-Health datasets 2017-9, TREC Total Recall, TREC Legal and Reuters RCV1) at multiple target recall levels. Results showed that the proposed approach to be effective compared to multiple baselines in addition to offering greater flexibility.

Tasks

Reproductions