SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Yulong Huang, Jianxiong Tang, Chao Wang et al.
Action editor: John Timothy Halloran
https://openreview.net/forum?id=uxb2jcCLxt
#spiking #spikingmamba #spike
1
0
0
0