Advertisement · 728 × 90

Posts by Thomas Capelle

This is the way, glad you liked it!

1 year ago 2 0 0 0

it's a really nice place, I agree!

1 year ago 0 0 0 0

They basically jailbreak gpt-4o

1 year ago 0 0 1 0

Same vibes git commit -m "pbar"

1 year ago 1 0 0 0

happy to take a look on a call =)

1 year ago 0 0 0 0

Can you share a workspace?

1 year ago 0 0 1 0

This was a team effort from @morgymcg.bsky.social , Soumik, @parambharat.bsky.social , Agata Mlynarczyk, @ayshthkr.bsky.social and many others!

1 year ago 0 0 0 0
Preview
Local Weave Scorers | W&B Weave Weave's local scorers are a suite of small language models that run locally on your machine with minimal latency. These models evaluate the safety and quality of your AI system’s inputs, context, and ...

I'm excited to see how the community uses these tools, and I'm looking forward to more innovations in safe and reproducible AI!

Check the scorers and Weave here:

👉 wandb.me/weave_scorers

📚 A colab: wandb.me/scorers_colab

1 year ago 0 0 1 0

A personal highlight was working on the Fluency Scorer powered by AnswerDotAI ModernBERT-base; we hope to move all DeBerta-powered scorers to ModernBert in the next release so we can benefit from the longer context length and training speed!

1 year ago 0 0 1 0
Advertisement

As part of this initiative, we also created comprehensive evaluation datasets, drawing on invaluable contributions from the open-source community. Being a reproducibility-first company, we’ve made the full recipe public, including the scorers, model weights, and the training and evaluation datasets

1 year ago 0 0 1 0

We designed these non-LLM powered scorers to leverage state-of-the-art open source models – from the PleIAI/Celadon toxicity detector to the Vectara hallucination scorer – ensuring that our AI systems are evaluated across multiple dimensions.

1 year ago 0 0 1 0
Preview
Local Weave Scorers | W&B Weave Weave's local scorers are a suite of small language models that run locally on your machine with minimal latency. These models evaluate the safety and quality of your AI system’s inputs, context, and ...

Over the past few months, my team at Weights & Biases has been hard at work launching Weave Scorers and guardrails.

wandb.me/weave_scorers

👇

1 year ago 0 0 1 0
Preview
Disappointed Cat GIF ALT: Disappointed Cat GIF
1 year ago 1 0 1 0
Post image

We are cooking here...

1 year ago 2 0 0 0

Same vibes, PR submitted, PR merged.

1 year ago 1 0 0 0
Post image

- new MacBook pro 😍
- french keyboard layout 😭

1 year ago 0 0 0 0
Post image

Many people have asked me about the France Action Summit.

I think a summit is typically most valuable as a catalyst, not as a solution in itself.

But, will share some observations.

1 year ago 42 10 2 2

It could have been called Gulf of North America

1 year ago 0 0 0 0
Advertisement

This is my favorite kind of Yoga

1 year ago 1 0 0 0

I just built a CI to run an Eval of some custom LLM scorers on top of @modal-labs.bsky.social
- Great to test against different GPUs
- No custom runner neded on github
- Fast and nice console outputs =)

1 year ago 2 0 0 0
Post image

Butternut Soup and kimchi side

1 year ago 3 0 0 0
Post image

Échalotes, tomates sèches et moules.

1 year ago 0 0 0 0

Samedi tu fais moules et frites, dimanche tu finis les moules dans un rissoto aux moules.

1 year ago 0 0 1 0
Why is the USA the only country in the world with bird flu H5N1 ripping through cattle herds?

Why is the USA the only country in the world with bird flu H5N1 ripping through cattle herds?

Because in the United States, it’s legal to feed chicken shit to cattle.

That’s why. That’s literally the reason

www.telegraph.co.uk/global-healt...

1 year ago 14913 4244 262 425
Post image Post image

Pancakes morning with the arrival of the @vendeeglobe.bsky.social

1 year ago 0 0 0 0
Post image

C'est super bon ça !

1 year ago 1 0 0 0

Don't miss Stacey in Paris!

1 year ago 0 0 0 0

We raised this internally! thanks for the info.

1 year ago 2 0 1 0
Advertisement

This budget forcing is really smart. We could do that we prefill on API models no?

1 year ago 0 0 1 0
Post image

This is getting out of hands...

1 year ago 6 0 0 0