1 year ago
0
1
0
0
Posts by
Can a language model improve itself without external verifier? We pose self-improvement as a computational challenge, and show how self-training might surmount it. Joint work with @djfoster.bsky.social and MSR.
Self-Improvement in Language Models: The Sharpening Mechanism
arxiv.org/abs/2412.01951
1 year ago
2
0
1
0