Pete Werner (@pete.penumbra.software) Bsky

A small number of samples can poison LLMs of any size Anthropic research on data-poisoning attacks in large language models

Am I missing something here or did they train a model to spout gibberish after a specific rare token then consider it noteworthy when it works? www.anthropic.com/research/sma...

6 months ago 0 0 0 0

Arguably RL has learnt something more general, ie what to do when encountering the plus operator, which can be applied or extrapolated to instances outside its training data.

6 months ago 0 0 0 0

Not familiar with the source that sparked this but take the context of SFT vs RL trying to learn the plus operator. SFT can conceivably rote learn every a + b = c, while RL could learn if a and b are numeric, put the sum after the = symbol.

6 months ago 1 0 1 0

The impressive thing about Gen AI is how often it actually works

6 months ago 0 0 0 0

Is it not about being able to validate a candidate response independent of any initial training data.

6 months ago 2 0 1 0

Inside NVIDIA GPUs: Anatomy of high performance matmul kernels - Aleksa Gordić From GPU architecture and PTX/SASS to warp-tiling and deep asynchronous tensor core pipelines.

Great write up on matmuls if you’re into the gory details www.aleksagordic.com/blog/matmul

6 months ago 0 0 0 0

AI Meetup (Sydney) with Atlassian - Reinforced Learning for AI Models Join over half million developers learning how to use and build AI through expert-led tech talks, workshops, bootcamps and crash courses. Level up your skills, and stay ahead of the industry | AICamp

I will be visiting Atlassian in a few weeks for a panel discussion on Reinforcement Learning, come along if you’re in Sydney www.aicamp.ai/event/eventd...

6 months ago 0 0 0 0

Another banger from Eugene Yan

6 months ago 0 0 0 0

Feel like I don’t hear AGI as much as I did 3-6 months ago. I guess the checks have cleared.

6 months ago 0 0 0 0

No I hope you talk to someone if you think it might help and are feeling better soon either way

6 months ago 1 0 0 0

It actually sounds like you may be depressed

6 months ago 0 0 1 0

Fleshing out a proposal with ChatGPT: 5 minutes
Validating the details: 4 hours

7 months ago 0 0 0 0

I block a lot of words like prominent names etc. it’s just not a conversation I can meaningfully contribute to or engage with

7 months ago 1 0 0 0

I don’t mind Gemini but they never listen to their customers.

7 months ago 0 0 0 0

If you feel old, ChatGPT just told me “you’re among the ancient ones of the web.”

8 months ago 0 0 0 0

Dang that looks good

8 months ago 1 0 0 0

Meme guy dropping truths

8 months ago 0 0 0 0

Tom Clancy

9 months ago 1 0 0 0

Startup idea: Secure MCP. It’s just mcp but the logo is a padlock.

9 months ago 0 0 0 0

Nice!

10 months ago 0 0 0 0

Hot take: Apple is second only to NVIDIA when it comes to AI. They have been doing it a long time, their own hardware and importantly mature and robust software on top of it. #wwdc

10 months ago 1 0 0 0

I aspire to the level of brazenness whoever makes the marketing charts for NVIDIA has attained

10 months ago 0 0 0 0

Remember in 2016 people were going to hail a self driving Uber instead of owning a car and driving themselves

10 months ago 0 0 0 0

RIP Civit

10 months ago 0 0 0 0

An ablation study is not mathematical rigor. It’s an empirical experiment.

10 months ago 0 0 0 0

It’s gotta happen imo. Book to chapters, chapters to paragraphs, paragraphs to sentences, sentences to words, words to letters. Low frequency to high frequency.

10 months ago 0 0 0 0

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency Large language models (LLMs) are widely applied in chatbots, code generators, and search engines. Workloads such as chain-of-thought, complex reasoning, and agent services significantly increase the i...

Nice looking work on LLM inference arxiv.org/abs/2505.01658

11 months ago 1 0 0 0

If you can’t think of any good use cases for LLMs maybe you’re just boring and uncreative

11 months ago 1 0 0 0

AI Meetup (Sydney): GenAI, LLMs and Agent Join over half million developers learning how to use and build AI through expert-led tech talks, workshops, bootcamps and crash courses. Level up your skills, and stay ahead of the industry | AICamp

If you are in Sydney this April 30 I will be giving a talk on scaling up AI services at AI Camp in Sydney. How we built and scaled the core AI services that drove our product to over 10 million users. Be sure to come along if it sounds of interest. www.aicamp.ai/event/eventd...

11 months ago 0 0 0 0

Open source is fine but it’s not possible to compete against someone like Google who provide production services at a loss. Unless you have funding and can do the same. Which is still putting control in the hands of the few that can run at a loss for extended periods of time.

11 months ago 1 0 0 0

Posts by Pete Werner