ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
Bairu Hou, Yang Zhang, Jiabao Ji, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang
Action editor: Jinwoo Shin
https://openreview.net/forum?id=V51gPu1uQD
#thinking #thinkprune #pruning
2
0
0
0