Martin Wattenberg (@wattenberg) Bsky

Everyone's talking about AI sycophancy and meanwhile ChatGPT just called my writing "very salvageable"

3 weeks ago 79 4 3 0

As AI capabilities increase, we need a broad, deep, society-wide discussion of what limits make sense, and how we can hold the government meaningfully accountable to citizens. For that reason, I stand with Anthropic and anyone else who is avoiding a rush toward mass AI surveillance.

1 month ago 26 5 2 0

That offers governments a vast, unprecedented level of power over their citizens. In evil hands, that’s obviously a disaster. But, like the framers of the US Constitution, I believe it’s also wrong to give absolute powers to democratically elected leaders, or people you think of as the “good guys.”

1 month ago 16 0 1 0

Before now, there was always a natural barrier on the power of surveillance. Even in the limit, if everyone’s actions were recorded all the time, there wouldn’t be enough people and time to watch and analyze the entire footage. But AI threatens to make that natural barrier completely obsolete.

1 month ago 19 3 1 0

I want to talk about why AI-based mass surveillance is so dangerous, and why I would oppose it no matter which party or president is in office.

1 month ago 49 10 3 0

What a cool idea! And I love the overall aesthetic!

1 month ago 2 0 1 0

This is @garrykasparov.bsky.social versus Deep Blue (game 2). Explore and interact with other games here:

moebio.com/chess/

(including fast Hikaru versus Magnus, longest, shortest and oldest games ever!)

1 month ago 17 4 1 1

The 2025 Name of the Year is Elara She doesn’t exist. Yet she’s everywhere—and she’s all of us.

Fascinating test! See also namerology.com/2025/12/15/2... for a deep dive on one AI name, "Elara". cc @babynames.bsky.social

3 months ago 3 0 0 0

I watched an animated movie in Palo Alto in the 90s, and when the credit appeared for a specific piece of graphics software, there was applause from the audience!

4 months ago 6 0 1 0

In 2025, AI left its imprint on everything—even names! If you've ever asked an AI to tell you a story, you've probably seen the Elara-Elena-Clara nexus...

4 months ago 7 0 0 0

I want to play with this clever book myself!

4 months ago 21 1 0 0

It will hide other names, if you ask!

4 months ago 0 0 0 0

Trying this a few more times, it turns out to work only sporadically. But still, that is one observant neural network!

4 months ago 0 0 1 0

I asked for a caricature of myself in the style of Al Hirschfeld, and Gemini knew to hide a NINA in my hair 😲

4 months ago 17 0 2 0

Charts and graphs help people analyze data, but can they also help AI?

In a new paper, we provide initial evidence that it does! GPT 4.1 and Claude 3.5 describe three synthetic datasets more precisely and accurately when raw data is accompanied by a scatter plot. Read more in🧵!

8 months ago 8 2 1 0

What AI Thinks It Knows About You What happens when people can see what assumptions a large language model is making about them?

AI is often thought of as a black box -- no way to know what's going on inside. That's changing in eye-opening ways. Researchers are finding "beliefs" models are forming as they converse, and how those beliefs correlate to what the models say and how they say it.

www.theatlantic.com/technology/a...

11 months ago 32 13 4 2

Historical popularity chart showing the popularity of Oliver rising to meet the previously much greater popularity of Olivia

The interactive NameGrapher is updated with 2024 baby name popularity stats! Come explore--and marvel that Oliver and Olivia have converged namerology.com/baby-name-gr...

11 months ago 9 2 1 0

A wonderful visualization for those of us obsessed by sunlight and geography!

11 months ago 28 1 1 0

An incredibly rich, detailed view of neural net internals! There are so many insights in these papers. And the visualizations of "addition circuit" features are just plain cool!

1 year ago 16 2 0 0

Great news, congrats! And glad you’ll still be in the neighborhood!

1 year ago 1 0 0 0

I'd be curious about advice on teaching non-coders how to test programs they've written with AI. I'm not thinking unit tests so much as things like making sure you can drill down for verifiable details in a visualization—basic practices that are good on their own, but also help catch errors.

1 year ago 10 0 2 0

Now that we have vibe coding, we need vibe testing!

1 year ago 22 4 6 0

Oh, that looks super relevant and fascinating, reading through it now...

1 year ago 1 0 0 0

Ha! I think (!) that for me, the word "calculate" connotes narrow precision and correctness, whereas "think" is more expansive but also implies more fuzziness and the possibility of being wrong. That said, your observation does give me pause!

1 year ago 0 0 0 0

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT)...

We're following the terminology of the DeepSeek-R1 paper that introduced this model: arxiv.org/abs/2501.12948 Whether it's really the best metaphor is certainly worth asking! I can see pros and cons for both "thinking" and "calculating"

1 year ago 1 0 1 0

Reasoning or Performing: locating "breakthrough" in the model's reasoning · ARBORproject arborproject.github.io · Discussion #11 Research Question When asked the DeepSeek models a challenging abstract algebra question, they often generated hundreds of tokens of reasoning before providing the final answer. Yet, on some questi...

These are great questions! I believe there's at least one graph of p(correct answer) on the main Arbor discussion page, and generally there are a lot more details: github.com/ARBORproject...

1 year ago 1 0 1 1

Interesting question! I haven't calculated this, but @yidachen.bsky.social might know

1 year ago 1 0 0 0

Colorful depictions of reasoning progress: most of the time the system settles on the correct answer but sometimes it vacillates in interesting ways.

This is a common pattern, but we're also seeing some others! Here are similar views for multiple-choice abstract algebra questions (green is the correct answer; other colors are incorrect answers) You can see many more at yc015.github.io/reasoning-pr... cc @yidachen.bsky.social

1 year ago 5 0 3 0

GitHub - ARBORproject/arborproject.github.io Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.

Very cool! You're definitely not alone in finding this fascinating. If you're looking for other people interested in this kind of thing, drop by the Arbor Project page, if you haven't already. github.com/ArborProject...

1 year ago 3 0 1 0

The wind map at hint.fm/wind/ has been running since 2012, relying on weather data from NOAA. We added a notice like this today. Thanks to @cambecc.bsky.social for the inspiration.

1 year ago 78 20 1 1

Posts by Martin Wattenberg