Advertisement · 728 × 90
#
Hashtag
#thinkprune
Advertisement · 728 × 90

ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning

Bairu Hou, Yang Zhang, Jiabao Ji, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang

Action editor: Jinwoo Shin

https://openreview.net/forum?id=V51gPu1uQD

#thinking #thinkprune #pruning

2 0 0 0