Auto Debugging
Papers
No papers found.
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PaLM 62B (few-shot, k=5) | Exact string match | 38.2 | — | Unverified |
| 2 | PaLM 540B (few-shot, k=5) | Exact string match | 38.2 | — | Unverified |
| 3 | PaLM 8B (few-shot, k=5) | Exact string match | 14.7 | — | Unverified |