answerability prediction
Papers
No papers found.
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Mistral-IT-v02-7B-32k | Macro F1 | 0.47 | — | Unverified |
| 2 | Command-R-v01-34B-128k | Macro F1 | 0.42 | — | Unverified |
| 3 | GPT-3.5-Turbo-0613-16k | Macro F1 | 0.33 | — | Unverified |
| 4 | Llama-3-IT-8B-8k | Macro F1 | 0.31 | — | Unverified |
| 5 | GPT-4o-2024-08-06 | Macro F1 | 0.31 | — | Unverified |
| 6 | Llama-3-IT-8B-32k | Macro F1 | 0.29 | — | Unverified |