Dialogue
Dialogue is notoriously hard to evaluate. Past approaches have used human evaluation.
Papers
No papers found.
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BART (TextBox 2.0) | BLEU-1 | 49.58 | — | Unverified |
Dialogue is notoriously hard to evaluate. Past approaches have used human evaluation.
No papers found.
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BART (TextBox 2.0) | BLEU-1 | 49.58 | — | Unverified |