Boomerang Distillation Enables Zero‑Shot Interpolation of Model Sizes
Boomerang distillation creates a size range from one teacher‑student pair, needing no extra training after distillation. The team released code on GitHub. Read more: getnews.me/boomerang-distillation-e... #boomerangdistillation #modelscaling
0
0
0
0