๐จ First ever post here~ New paper out! Instruct models are not always the best. Scaling down ๐ instruction tuning strength via partial adaptation leads to material gains ๐ on few-shot in-context learning NLP tasks across model families and sizes.
arxiv.org/abs/2504.11626
1 year ago
11
3
0
0