Advertisement ยท 728 ร— 90

Posts by Jonathan Hayase

Tokenizers govern the allocation of computation. It's a waste to spend a whole token of compute predicting the "way" in "By the way". SuperBPE redirects that compute to predict more difficult tokens, leading to wins on downstream tasks!

1 year ago 4 1 0 0
poster for paper

poster for paper

excited to be at #NeurIPS2024! I'll be presenting our data mixture inference attack ๐Ÿ—“๏ธ Thu 4:30pm w/ @jon.jon.ke โ€” stop by to learn what trained tokenizers reveal about LLM development (โ€ผ๏ธ) and chat about all things tokenizers.

๐Ÿ”— arxiv.org/abs/2407.16607

1 year ago 13 4 0 0