Advertisement · 728 × 90

Posts by Integrity Institute

Preview
Examining Constitutional AI — Integrity Institute By Theodora Skeadas, Iain Levine, Stephanie Nakano, and Leah Ferentinos This blog is the first in a series from the Integrity Institute that examines the risks and potential harms that Claude’s con...

I recently collaborated with a group of @integrity-inst.bsky.social - Iain Levine, Stephanie Nakano, and Leah Ferentinos - to reflect on Claude’s AI Constitution.

www.integrityinstitute.org/blog/examini...

16 hours ago 1 1 1 0
Preview
Mental Health as a Hidden Cost of AI Systems A blogpost about why mental health must be treated as a core consideration in AI design and evaluation

Where does mental health fit in how we design and evaluate AI systems? As AI becomes more embedded in daily life and work, its psychological impact—on both users and the people behind these systems—is becoming harder to ignore.

Read our recent blog post: humane-intelligence.org/post/mental-...

6 days ago 0 1 0 0

Written by @theodoraskeadas.bsky.social, @iainlevine.bsky.social, #StephanieNakano, and @leahf.bsky.social

#AIGovernance #TechPolicy #ConstitutionalAI

1 week ago 0 0 1 0
Preview
Examining Constitutional AI — Integrity Institute By Theodora Skeadas, Iain Levine, Stephanie Nakano, and Leah Ferentinos This blog is the first in a series from the Integrity Institute that examines the risks and potential harms that Claude’s con...

🚨 New blog series from the Integrity Institute:

Our first post examines the risks of @anthropic.com's Claude constitution for a broad user base - from what "constitutional AI" really means to why Anthropic should be citing human rights law.

Read here 🔗 bit.ly/4clWJVM

Stay tuned for more!

1 week ago 1 1 2 0
Post image

Jenn Louie, Ryan Roberts, and I have published a piece in @techpolicypress.bsky.social that examines moral injury in the trust and safety profession. I first became aware of this phenomenon when Jenn hosted a series of powerful discussions on moral injury through the @integrity-inst.bsky.social.

2 weeks ago 0 1 1 0
Post image

The Integrity Institute has launched a new pro bono initiative connecting US state lawmakers with Trust & Safety practitioners.

We provide nonpartisan, real-world expertise to inform tech policy - grounded in the operational realities of those addressing online harms.

Sign up: bit.ly/4vfxehR

2 weeks ago 0 0 0 0
Post image

I am very excited to share that the @integrity-inst.bsky.social has recently launched a new, pro bono "Advising US State Lawmakers" initiative. We now have a formal process to connect lawmakers with practitioners who can offer nonpartisan, practical input on technology policy.

3 weeks ago 1 1 1 0
Advertisement

@leahf.bsky.social #AdityaGautam #ChrisMiles #OmriTubiana #ArushiSaxena #JJMartinezLayuno #DavidJay

#AI #LLMs #Misinformation #TrustAndSafety #Ethics #ResponsibleAI #TechPolicy #ContentModeration #Governance #DigitalTrust #PlatformAccountability

3 weeks ago 0 0 0 0
Preview
Managing Misinformation in Large Language Models — Integrity Institute By Leah Ferentinos, Aditya Gautam, Chris Miles, Omri Tubiana, Arushi Saxena, J. J. Martinez Layuno and David Jay A growing set of tools and frameworks now exists to help ensure AI systems are built ...

Earlier this month, II members published a piece on managing misinfo in LLMs.

The article explores challenges like hallucinations and scaling safeguards. Practitioners identify the tools & frameworks that can help ensure AI systems are built on verified, high-quality info.

Read here bit.ly/4bKow1Z

3 weeks ago 0 0 1 0

@theodoraskeadas.bsky.social @leahf.bsky.social @vaishnavi.bsky.social #MaggieEngler #RohitPoduval

#AISafety #AIGovernance #PublicInterestTech #DigitalSafety #AIAgents #AIAccountability #SafetyByDesign #GovTech #NIST

4 weeks ago 0 0 0 1
Preview
Information About Securing AI Agent Systems — Integrity Institute By Theodora Skeadas, Leah Ferentinos, Rohit Poduval, and Maggie Engler Integrity Institute members respond to the Center for AI Standards and Innovation (CAISI), National Institute of Standards and ...

II members recently submitted input to NIST’s CAISI request for information on securing AI agent systems.

Members highlight:
▶️ New risks from autonomous agents
▶️ Gaps in current cybersecurity
▶️ The need for stronger evaluation & safeguards

Read the submission in full here: 🔗 bit.ly/4lUnMfy

4 weeks ago 2 1 2 0
Preview
Campaign Organizer – Digital Rights & Online Privacy - Internship We are seeking a part-time early-career digital rights advocate, policy student, or campaign organizer to help build a public interest campaign addressing a new wave of state-level online age-verifica

If you’ve led grassroots campaigns against surveillance or censorship, COSL wants you. Join us as Campaign Organizer (Digital Rights & Online Privacy) and coordinate global partners fighting for safer, freer digital spaces. Apply here: https://ideali.st/AsH7SS #FreeExpression #Activism #intern

1 month ago 0 2 0 0

@theodoraskeadas.bsky.social @sarah3amos.bsky.social @leahf.bsky.social @gabefreeman.bsky.social @mattmotyl.bsky.social #RupiSureshkumar #AniketAjagaonkar #AprilLat #SofiaBonilla

1 month ago 2 0 0 0
Preview
Response to Meta's Non-Consensual AI Sexualized Impersonation Under Oversight Board Review — Integrity Institute By Theodora Skeadas, Sarah Amos, Leah Ferentinos, Gabe Freeman, Rupi Sureshkumar, Sofia Bonilla, Matt Motyl, Aniket Ajagaonkar, and April Lat Integrity Institute members weigh in on the Oversight Bo...

Members submitted input to the
@oversightboard.bsky.social on Meta’s non-consensual AI sexualized impersonation.

AI-generated imagery is among the most harmful forms of digital abuse. Meta needs stronger enforcement pathways & better remediation for victims.

▶️ Read our analysis: bit.ly/4uBjjSH

1 month ago 3 2 1 0
Preview
Teens allege Musk’s Grok chatbot made sexual images of them as minors Three teenaged plaintiffs in a lawsuit filed Monday accuse xAI of distribution, possession and production with intent to distribute child pornography.

NEW: Three teens allege xAI and Grok were used to generate nude underage images. Photos from homecoming, yearbook, beach outings were turned into CSAM and distributed on Discord and Telegram + some were bartered for other child abuse imagery, a lawsuit alleges. www.washingtonpost.com/technology/2...

1 month ago 107 72 5 11
Preview
Online Child Safety at the MIT Policy Hackathon — Integrity Institute By April Lat Last November, the Integrity Institute partnered with MIT's 8th Annual Policy Hackathon as a Challenge Sponsor, presenting a pressing question at the heart of modern internet governance...

👾 Online Child Safety at the MIT Policy Hackathon

We partnered with @mit.edu for the Annual Policy Hackathon to ask: How should the U.S. address its patchwork of state-level approaches to child online safety?

April Lat breaks down the winning proposal & key lessons from the weekend: bit.ly/4ljje1O

1 month ago 0 0 0 0
Advertisement
Preview
How to Manage Misinformation in Large Language Models A group of trust and safety professionals say models are ultimately only useful when they can be trusted.

The value proposition for AI systems ultimately hinges on trust, write Leah Ferentinos, Omri Tubiana, Arushi Saxena, J.J. Martinez-Layuno and Chris Miles. The necessary tools and frameworks are emerging to deliver it, they say.

1 month ago 3 1 0 0

@theodoraskeadas.bsky.social @leahf.bsky.social @geewiz.bsky.social #OsirisParikh #SuziRagheb #accountdeactivation #disablingaccounts #platformgovernance #digitalrights #policy #OversightBoard #Meta

1 month ago 0 0 0 0
Preview
Response to Meta's Account Disabling Policies Under Oversight Board Review — Integrity Institute By: Theodora Skeadas, Leah Ferentinos, Osiris Parikh, Glenn Ellingson, Suzi Ragheb Integrity Institute members weigh in on the Oversight Board's public comment process: Board to Review for First Ti...

Integrity Institute members submitted input to the @oversightboard.bsky.social on Meta’s account disabling policies calling for transparency, procedural fairness & meaningful human review.

This is crucial as account bans shape who gets to participate online.

▶️ Read our analysis: bit.ly/4ck6PZ7

1 month ago 0 0 1 0
Preview
Osprey: Open Sourcing our Rule Engine Discord uses Osprey to quickly detect and remove new types of harm from putting our customers at risk. Now we’re open-sourcing this tool so others can do the same.

Open sourcing Osprey was an adventure! @discord.com is now powered by the open source community version of Osprey. Read about how Osprey works and what makes a rules engine for fighting abuse powerful:

discord.com/blog/osprey-...

1 month ago 88 16 3 1
Post image

📽️ We are live! Tune into Teens & Screens livestream where we will hear from Pew Research Center, youth and parents on how youth are navigating the digital world.

#TeensOnline

https://ow.ly/ey6v50YjtyP

1 month ago 1 1 0 0

@theodoraskeadas.bsky.social @shannonrsingh.bsky.social @rachelfagen.bsky.social #SuziRagheb #SabhanazRashidDiya #DevikaMalik

2 months ago 1 0 1 0
Preview
Escalations of Digital Warfare in Gaza — Integrity Institute By Shannon Raj Singh, Theodora Skeadas, Suzi Ragheb, Rachel Fagen, Sabhanaz Rashid Diya, and Devika Malik Image credit: Israeli Prime Minister Benjamin Netanyahu holds a joint press conference with ...

⚠️ Escalated Digital Warfare in Gaza

Originally published in @techpolicypress.bsky.social, II members unpack how Netanyahu's UNGA address—broadcast into Gaza via loudspeakers & cellphones—reveals the risks surrounding telecom control, free expression & civilian comms in crisis.

🔗 bit.ly/405ullr

2 months ago 3 2 1 0
Post image

Feb 26: Join researchers @livgar.bsky.social & @briana-v.bsky.social, in conversation with Luca Belli, @mbogen.bsky.social & @marlynnweimd.bsky.social, as they explore ongoing research on mental health and chatbots, and how to navigate a profound shift in care. datasociety.net/events/menta...

2 months ago 7 4 0 1
Advertisement

Thanks to @integrity-inst.bsky.social for spotlighting my piece with @theodoraskeadas.bsky.social, @nickgarcia.bsky.social, and @elisephillips.bsky.social!

In it, we argue that the cloud is critical infrastructure that shouldn't be left to the free market. It's up to states to provide oversight.

2 months ago 3 2 2 0
Preview
Why The Cloud Should Be a Public Utility — Integrity Institute By Michelle Nie, Theodora Skeadas, Nick Garcia, Elise Phillips We rely on the cloud every day to access government, healthcare and educational services. We access our government benefits, file taxes,...

☁️ Why The Cloud Should Be a Public Utility

Originally in @techpolicypress.bsky.social, @theodoraskeadas.bsky.social & co-authors reveal that cloud computing underpins everything from healthcare to AI - yet it’s treated like a regular commercial market, with minimal accountability.

bit.ly/402C8jA

2 months ago 6 3 0 1
Post image

Today is Safer Internet Day 🌐

A safer online world takes collaboration, transparency, and investment in the people keeping platforms safe.

We celebrate Trust & Safety practitioners, researchers, and civil society working behind the scenes to reduce harm and build trust.

2 months ago 1 0 0 0
Preview
Control for Whom? Keeping an Eye on the Dark Side of US's New Wearables Campaign. Nada Salem and Theodora Skeadas discuss the government push for health data tracking and the privacy and surveillance concerns for the public.

🔎 Control for Whom?

Nada Salem & II member @theodoraskeadas.bsky.social examine the dark side of the U.S. push for wearable health tech, where wellness data often sits outside HIPAA and raises real privacy & equity risks. Originally on @techpolicypress.bsky.social.

Read more here: bit.ly/3NSXXj0

2 months ago 1 1 0 0

#GenAI #ChildSafety #TechPolicyPress #RedTeaming #EduTech

2 months ago 0 0 0 0
Post image

🚩 Red Teaming Generative AI in Classrooms & Beyond

Members #JenWeedon, @theodoraskeadas.bsky.social, & @sarah3amos.bsky.social show how red teaming can uncover risks in “technically safe” AI systems before they reach students. Originally published in @techpolicypress.bsky.social.

🔗 bit.ly/3MbgLJJ

2 months ago 3 2 1 0