SOTAVerified

Dialogue

Dialogue is notoriously hard to evaluate. Past approaches have used human evaluation.

Papers

Showing 11 of 1 papers

TitleStatusHype
TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsCode3
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-149.58Unverified