Fernanda Viégas (@viegas) Bsky

As AI capabilities increase, we need a broad, deep, society-wide discussion of what limits make sense, and how we can hold the government meaningfully accountable to citizens. For that reason, I stand with Anthropic and anyone else who is avoiding a rush toward mass AI surveillance.

1 month ago 26 5 2 0

How do AI chatbots see us? BKC Spring Speaker Series EventWhen you talk with a chatbot, what does it “think” about you? Recent work in AI interpretability, based on high-dimensional geometry, is beginning to provide some intrig...

Join us tomorrow as @wattenberg.bsky.social and I talk about how instrumenting AI chatbots with real-time dashboards can help reveal social cognition capabilities -- something that can be both useful and problematic.

This talk is open to the public.

cyber.harvard.edu/events/how-d...

1 year ago 6 0 0 0

ARBORproject arborproject.github.io · Discussions Explore the GitHub Discussions forum for ARBORproject arborproject.github.io. Discuss code, ask questions & collaborate with the developer community.

Take a look at some initial research projects, and see if there's one you'd like to work on:
github.com/ARBORproject...
Or propose your own idea! There are many ways to contribute, and we welcome all of them.

1 year ago 8 2 1 0

GitHub - ARBORproject/arborproject.github.io Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.

Excited to announce ARBOR: a radically-open project on AI interpretability for reasoning models
github.com/ARBORproject...

Join us in collectively analyzing and interpreting how reasoning works!

1 year ago 12 1 0 0

Posts by Fernanda Viégas