Advertisement · 728 × 90

Posts by stochasm

Bless everyone who records their talks

1 year ago 3 0 0 0
Post image

“They said it could not be done”. We’re releasing Pleias 1.0, the first suite of models trained on open data (either permissibly licensed or uncopyrighted): Pleias-3b, Pleias-1b and Pleias-350m, all based on the two trillion tokens set from Common Corpus.

1 year ago 248 85 11 19

What are some solid pieces of writing in the vein of situational awareness and machines of loving grace?

1 year ago 0 0 0 0

Yeah, and thinking about it more, I guess a larger general model would probably beat out smaller specialized models too

1 year ago 1 0 0 0

What about with each agent having a different model?

1 year ago 0 0 1 0

You love to see it

1 year ago 0 0 0 0

Good one haha

1 year ago 2 0 0 0

Oh I see, that’s pretty cool

1 year ago 0 0 0 0

Why would you do this? Lol

1 year ago 0 0 1 0
Advertisement

Classic

1 year ago 1 0 0 0

Many are saying

1 year ago 1 0 0 0

It’s time to swap my terminal from simple dark/grayscale to colorful again. The question is, which color scheme?

1 year ago 0 0 1 0

Good morning everyone what’s the plan for today

1 year ago 0 0 0 0

I like the idea of the first name being a subdomain

1 year ago 0 0 0 0

Thanks!

1 year ago 0 0 0 0

I thought I was, but will check when I try again today

1 year ago 0 0 0 0

Can blocklists contain blocklists? Can you have a cycle in the graph?

1 year ago 3 0 0 2

White noise is the meta I’ve heard

1 year ago 2 0 0 0

Yeah I needed to see this today lol

1 year ago 1 0 0 0
Advertisement

Yeah makes sense honestly

1 year ago 2 0 0 0

Thanks!

1 year ago 0 0 0 0

Lol you just use F.sdpa?

1 year ago 1 0 1 0

Why is it so slow

1 year ago 3 0 4 0

Building wheel for flash-attn (setup.py) … /

1 year ago 6 0 1 0

Holy

1 year ago 1 0 0 0

scraping is hitting a wall

1 year ago 28 3 1 0
Advertisement

I think they do mean that it’s a preview/experimental model since they’ve identified some issues with it in their blog post

1 year ago 0 0 0 0
Post image

With some sort of scaling graph too! Wish they were more clear about what the x-axis represents

1 year ago 3 0 1 0

Finished training in the morning and evals already done is impressive speed

1 year ago 0 0 1 0

Wow

1 year ago 0 0 0 0