Dr Heidy Khlaaf (هايدي خلاف) (@heidykhlaaf) Bsky

For those interested I previously wrote about how Anthropic and others co-opt safety and defense terminology to take over the arbitration of risk determinations, thus subverting democratic processes
arxiv.org/abs/2504.15088

2 days ago 6 3 1 0

Palantir's manifesto is not new for us who have been critical of them for years. But a reminder that Anthropic has a partnership with them, so it's crucial that people understand the politics of Anthropic's unsubstantiated AI claims and their relation to fulfilling the manifesto.

2 days ago 45 19 1 0

I wrote this skeet as a hypothetical and then 24 hours later 🫠

3 days ago 592 233 24 39

Why is pro-AI still posturing as anti-AI, and why is the general public still willing to believe the charade?

1 week ago 66 14 7 9

Concerns about Mythos lie in its ‘purposely vague language’ – AI expert As concerns have been raised about Anthropic’s new AI model Mythos, we spoke to Dr Heidy Khlaaf, chief Artificial Intelligence scientist at the AI Now Institute.

On @channel4news.bsky.social @heidykhlaaf.bsky.social explains the stakes of Anthropic’s Mythos and its impact on security. We lack the evidence and independent expertise needed to verify the claims around the security tool - especially problematic given the risks posed by LLMs.

shorturl.at/kNtU9

1 week ago 12 7 2 0

I joined Hari Sreenivasan on CNN International and PBS to discuss the use of AI in warfare and the impacts we're already seeing of this fallible technology being used in Iran, and how it ultimately obscures accountability. Full interview can be found at youtube.com/watch?v=w16f...

1 month ago 10 9 1 0

I joined Hari Sreenivasan on CNN International and PBS to discuss the use of AI in warfare and the impacts we're already seeing of this fallible technology being used in Iran, and how it ultimately obscures accountability. Full interview can be found at youtube.com/watch?v=w16f...

1 month ago 10 9 1 0

Fission for Algorithms: The Undermining of Nuclear Regulation in Service of AI - AI Now Institute A report examining nuclear “fast-tracking” initiatives on their feasibility and their impact on nuclear safety, security, and safeguards.

We discuss these risks further in our report here:
ainowinstitute.org/publications...

1 month ago 6 4 0 0

Centrus Partners with Palantir to Drive Cost Savings and Unlock Operational Efficiencies in Major Expansion of U.S. Uranium Enrichment Capacity - Centrus Energy Corp BETHESDA, Md. – Palantir Technologies Inc. (NASDAQ: PLTR), a leading provider of AI systems and enterprise operating systems, and Centrus Energy (NYSE: LEU), the U.S. company leading the effort to res...

Not enough people are concerned with how AI companies are getting access to nuclear secrets, which now includes uranium enrichment. This raises serious concerns over whether this may lead to nuclear proliferation, and further entrench power asymmetries.
www.centrusenergy.com/news/centrus...

1 month ago 7 4 1 1

The Business of Military AI The Pentagon has been spending tens of billions of dollars to adopt new technologies at breakneck speed. Without oversight and safeguards, military applications of artificial intelligence could jeopar...

As @amostoh.bsky.social and I explain in our new report, the military has been ramping up its adoption of AI, while oversight and safeguards have failed to keep up.

But the Pentagon’s dispute with Anthropic has brought a grave threat into focus: using AI to pry into Americans’ private lives 🧵 1/

1 month ago 36 23 1 1

Decision Support Systems and Autonomous Weapons Systems, where Claude is currently deployed as the former in Maven.

1 month ago 2 0 0 0

Note how the AI "recommendations" are completely obscured with little to no ability to actually verify or trace their outputs. This is what we mean when we say the distinctions between DSS and AWS are superficial in practice, especially when operators are given seconds to approve.

1 month ago 35 16 2 4

Can Anthropic’s AI Claude be trusted in combat?| The Take YouTube video by Al Jazeera English

It was great to join @aljazeera.com's podcast "The Take" to discuss the details of the DoW's use of Claude in Iran, as well as the stand-off between DoW and Anthropic that was largely safety theatre.
www.youtube.com/watch?v=skyI...

1 month ago 7 3 0 1

In this Tech Policy piece, I criticize how framings of Anthropic’s & OpenAI’s negotiations with the US’s DoW overindex on myopic interpretations of human oversight, papering over what should be the real target of our scrutiny: that generative AI algorithms are a flawed and inaccurate technology.

1 month ago 38 17 3 0

Exactly.

1 month ago 1 0 0 0

Anthropic’s AI tool Claude central to U.S. campaign in Iran, amid a bitter feud Anthropic’s AI tool Claude is playing a key role in the U.S. military’s campaign in Iran, amid a bitter fight with the Pentagon over the terms of its use in war.

It’s egregious for the WaPo to describe speed as the advantage against Iran w/ Claude. When these systems are incredibly inaccurate, they may as well be enabling indiscriminate targeting (e.g. schools), which isn’t the strategic win they’re framing it as.

www.washingtonpost.com/technology/2...

1 month ago 9 4 1 0

The one question everyone should be asking after OpenAI’s deal with the Pentagon The US said we can’t afford to let a surveillance state like China win the AI race. Well...

Was happy to speak to Vox on OpenAI's alleged AWS guardrails. Besides current guardrails being trivial to bypass, they can't enforce human oversight over the outputs of an AI algorithm. It's an operational matter not a technical one, and thus infeasible by any guardrails.
www.vox.com/future-perfe...

1 month ago 2 2 0 0

Using foundation models in national security contexts may introduce unique concerns threatening human rights. For example, a government’s ability to train models on citizens’ data obtained through commercial data brokers that would otherwise need a warrant, court order, or subpoena to obtain may allow governments to further exercise coercive powers that are automated through AI decision-making [6]. Such use may subvert due process, exacerbated when inaccurate outputs inflict unjust harms on civilians. Appropriate interventions may include the extension of data minimization principles to include purpose limitations on the collection, processing, and transfer of personal data to third parties for intelligence purposes.

The Atlantic notes how the Pentagon wants to "analyze bulk data collected from Americans." From our "Mind the Gap" paper 2024, a snippet I have come back to what seems like dozens of time at this point.
www.theatlantic.com/technology/2...

1 month ago 33 19 2 0

(Ir-)Responsible by Design? Corporate Guardrails and the Governance of Military AI [Jessica Dorsey is an Assistant Professor of International Law at Utrecht University School of Law; Elke Schwarz is a Professor of Political Theory at Queen Mary University London; Ingvild Bode is …

The Anthropic-U.S. DoD public dispute continues. We have unpacked the latest updates & the safety, legal & ethical concerns in our @opiniojuris.bsky.social article

@jessicadorsey.bsky.social @elkeschwarz.bsky.social @profbode.bsky.social @ncrenic.bsky.social

opiniojuris.org/2026/03/02/i...

1 month ago 9 7 0 0

At least 63 girls killed in strike on school in southern Iran Eyewitness tells MEE girls aged between seven and 12 seen lying dead across their school

This account includes an eyewitness.

Also: “At least 85 people, almost all of them young girls, have been killed in an air strike on a primary school in southern Iran, the Iranian judiciary said.”

1 month ago 163 93 3 16

In case you’re just waking up, the U.S. has teamed up with Israel overnight to start an illegal war of regime change, apparently on a presidential whim with no involvement of Congress, and they are already committing horrific atrocities.

1 month ago 6537 2757 1 73

Google Workers Seek ‘Red Lines’ on Military A.I., Echoing Anthropic

I consider this a loss rather than a win, as just a few years ago the redline was any military use, now it’s the most extreme use case of LAWS. AI companies have successfully moved safety thresholds without effective internal pushback.

www.nytimes.com/2026/02/26/t...

1 month ago 11 5 1 0

I have to give Anthropic credit for recognizing that deploying unreliable AI in AWS is not strategic for the future of AI. But there's a very fine line between DSS and AWS in practice due to automation bias, if they don't believe it's reliable for the latter, it's not reliable for the former either.

1 month ago 22 9 1 0

Some real cognitive dissonance happening with takes saying "but Anthropic HAD to drop their safety measures, they're the good guys you see!" Anyway from our paper last year:

1 month ago 16 8 1 0

Exclusive: Pentagon clashes with Anthropic over military AI use, sources say The Pentagon is at odds with artificial-intelligence developer Anthropic over safeguards that would prevent the government from deploying its technology to target weapons autonomously and conduct U.S....

If flawed and inaccurate LLMs are instrumented in AWS by replacing humans for decision making, then "wars" may as well be indiscriminate lethal campaigns. Anthropic's position also isn't a moral high ground given their AI-DSS uses w/ Palantir, where automation bias may lead to similar outcomes.

1 month ago 9 5 1 1

Army using AI to update doctrine Leaders at the Combined Army Doctrine Directorate have started training authors on generative AI tools to speed up research and drafting.

Yikes.

www.militarytimes.com/news/your-mi...

1 month ago 22 7 0 1

There's a constant AI-washing of terms so these companies can claim they're solving a problem that doesn't exist with AI. Static analysis/formal methods also put forward suggestions, have they even used these tools?
Claude Code may also generate up to 90% insecure code (arxiv.org/pdf/2512.03262).

2 months ago 13 3 0 0

As a formal methods PhD, it's embarrassing for Anthropic to incorrectly describe static analysis in their Claude Code Security announcement. Security and formal methods engineers already have data "reasoning" tools, this isn't the bottleneck, false positives, which LLMs absolutely have, is.

2 months ago 36 12 4 1

Posts by Dr Heidy Khlaaf (هايدي خلاف)