Audio Generation
Audio generation (synthesis) is the task of generating raw audio such as speech.
( Image credit: MelNet )
Papers
Showing 1–10 of 270 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | VAB-Encodec (Ours) | Bits per byte | 40 | — | Unverified |
| 2 | Sparse Transformer 152M (strided) | Bits per byte | 1.97 | — | Unverified |