Krony-PT: New Kronecker‑Product Compression Cuts GPT‑2 Size
Krony-PT compresses GPT‑2 feed‑forward layers via Kronecker products, reducing parameters from 124 M to 80‑96 M. The 81 M model outperforms DistilGPT2, while the 96 M version matches the original. getnews.me/krony-pt-new-kronecker-p... #gpt2 #kroneckerproduct
0
0
0
0