Advertisement Β· 728 Γ— 90

Posts by Lintang Sutawika

Post image

Can you train a performant language model using only openly licensed text?

We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1 & 2

10 months ago 146 60 2 2

Damn, where are these parties i’m missing πŸ˜‚

1 year ago 2 0 0 0

Technically, we do but a lot of that goes paying tuition. Not unlike the 20k for these agents going towards GPU compute πŸ€ͺ

1 year ago 0 0 1 0

Maybe he thought it was β€œlocker room talk” πŸ€ͺ

1 year ago 0 0 0 0
Preview
Google β€œWe Have No Moat, And Neither Does OpenAI” Leaked Internal Google Document Claims Open Source AI Will Outcompete Google and OpenAI The text below is a very recent leaked document, which was shared by an anonymous individual on a public Disc…

Feels like a great time to re-share this

semianalysis.com/2023/05/04/g...

1 year ago 9 0 0 0

They're future-proofing the design 😎

1 year ago 2 0 1 0

The `decision model n` is being directed by mission control and then forwards a signal to `big data`?? I guess no decision was ever made πŸ˜‚πŸ˜‚πŸ˜‚

1 year ago 1 0 1 0
Advertisement

Maybe. But probably more likely, they're using QwQ or Deepseek.

1 year ago 4 0 1 0

Transformers demonstrated how to attend an entire sequence length which at the time was different to many approaches like LSTM that processed tokens sequentially. The attention span across the whole sequence does parallel the aliens from Arrival.

1 year ago 5 0 0 0

πŸ™‹β€β™‚οΈ

1 year ago 1 0 0 0

Attended 2 different lectures (1 class and 1 invited guest lecture) with the similar topic of inference-time scaling. Maybe the matrix is trying to tell me something.

1 year ago 0 0 0 0

Lectures in #nlp I see that use Taylor Swift to illustrate concepts.

1 year ago 1 0 0 0

@eleutherai.bsky.social is our official account. Will be posting here and on Twitter from now on.

1 year ago 20 3 2 0

LTI PhDs seeking refuge in Bluesky
go.bsky.app/NhTwCVb

1 year ago 4 0 0 0
Advertisement

Hi, I would also like to be included in this list!

1 year ago 1 0 0 0