The Super Weight in Large Language Models
Setting as few as a single weight to zero will make various LLMs go from generating coherent text to outputting gibberish.
arxiv.org/abs/2411.07191
1 year ago
12
3
2
0