Multi-Codebook Speech Generation with Frame‑Stacked Local Transformers
The study, submitted on 23 Sep 2025, shows frame‑stacked parallel decoding with a MaskGIT local transformer speeds up speech generation while keeping quality acceptable. Read more: getnews.me/multi-codebook-speech-ge... #multicodebooks #maskgit
0
0
0
0