In a new post, Tom Davidson drafts a model spec to guide how AI gives advice in key scenarios, and compares some ideal examples of AI advice to what today's leading models actually say.
Read it here: www.forethought.org/research/ai...
Posts by Forethought
As AI gets smarter, people will increasingly turn to it for advice on important decisions, so the quality of AI advice really matters.
How is moral diversity valuable for achieving a near-best future? A new post introduces several models for thinking about the value of moral diversity as the number of powerholders scales.
Read it here: newsletter.forethought.org/p/the-value...
AI could dramatically transform how we collectively determine what's true—for better or worse. In a new post, the authors map out the possible impacts of AI on society's epistemics: the good, the bad and the ugly.
In a new post, the authors present design sketches exploring how AI-enabled coordination tech could be built to favor defense over offense.
Read it here: www.forethought.org/research/de...
Near-term AI could make it dramatically easier for groups to find deals, resolve disputes, and hold each other accountable. But the same tools could enable collusion and worse.
New post: AIs should (sometimes) be proactively prosocial.
Read it here: www.forethought.org/research/ai...
x.com/willmacaski...
What if... we could use AI to help build the kind of AI that would empower us to work out what's true?
Introducing: AI for AI for epistemics.
www.forethought.org/research/ai...
New post: concrete projects to prepare for superintelligence.
Read it here: www.forethought.org/research/co...
x.com/willmacaski...
New post: William MacAskill and Tom Davidson argue that AI character is a big deal.
Read it here: www.forethought.org/research/the...
New post: should we lock in post-AGI agreements under uncertainty?
Read it here: www.forethought.org/research/sh...
We argue that making sure future people can coordinate to fund moral public goods could be a big deal for how well the long-term future goes: www.forethought.org/research/mo...
"Moral public goods" are things many people value for ethical reasons, but where no individual's contribution is worth it unless others contribute too, creating large potential gains from coordination.
New podcast episode: chatting with economist Sam Hammond about what happens to public institutions when AI collapses transaction costs.
www.youtube.com/watch?v=grG...
Today we’re publishing another set of design sketches, illustrating what some of these tools might look like more concretely.
You can read the full article here: www.forethought.org/research/de...
Tools for strategic awareness could deepen people’s understanding of what’s actually going on around them, making it easier for them to make good decisions in their own interests. This would have big implications both for individuals and for collective decision-making.
We’ve recently published two pieces of background research that informed our thinking on an international AGI project:
• An overview of some international organisations, with their voting structures
• The UN Charter: a case study in international governance
The first set of design sketches focus on collective epistemics: tools that make it easy to know what’s trustworthy and reward honesty.
Technologies powered by near-term AI systems could transform our ability to reason and coordinate, significantly improving our chances of safely navigating the transition to advanced AI.
Last week, we launched series of design sketches for specific technologies that we think could help.
We think tools like this will be possible soon, and could meaningfully help humanity to navigate the transition to advanced AI. Today we’re publishing a set of design sketches describing some of these tools in more detail.
Imagine having a technological analogue to an ‘angels-on-the-shoulder’: a customised tool or tools that help you make better decisions in real time, decisions that you more deeply endorse after the fact.
Should we focus on worlds where AGI comes in the next few years? People often argue yes, because short timelines have higher leverage. We're not so sure. New post arguing that for many people, 2035+ timelines might be highest leverage: www.forethought.org/research/sh...