Advertisement · 728 × 90

Posts by Muru Zhang

Great to be part of this project led by the amazing @hamishivi.bsky.social. The most fun (in retrospect) thing is to observe how the results start to shift as we scale up the candidate pool, evaluation suite, and selection size :) And eventually we find a simple method does the best!

1 year ago 2 1 0 0
Post image

How well do data-selection methods work for instruction-tuning at scale?

Turns out, when you look at large, varied data pools, lots of recent methods lag behind simple baselines, and a simple embedding-based method (RDS) does best!

More below ⬇️ (1/8)

1 year ago 13 4 1 2

This is a great effort for the migration, thanks for putting it together! Can I be added to the list?

1 year ago 1 0 1 0