When a company says an AI tool is too dangerous to release, that may be true.
But it is never just a safety story.
It is also a power story.
The real question is not only what the tool can do.
It is who gets to decide you should not have it.
aiissht.com/too-dangerou...
#AI #AISafety #Systems
Anthropic is holding back Mythos and routing it into Project Glasswing instead of a public launch. That says a lot about where AI cyber risk is heading. aintelligencehub.com/articles/ant... #Anthropic #Cybersecurity #AISafety
winbuzzer.com/2026/04/10/o...
OpenAI Releases Child Safety Blueprint as AI Abuse Reports Surge
#AI #OpenAI #ChatGPT #GenAI #AISafety #AIEthics #ChildProtection #OnlineSafety
#Anthropic ’s #Mythos #AI proves that obsessing over #AGI is folly
www.fastcompany.com/91524611/ant...
The #tech industry is being forced to face its implications right this very minute
#ClaudeMythosPreview #ClaudeMythos #Claude #AIsafety #PotatoSeurity #InfoSec #BigTech #techNews
#Anthropic ’s #Mythos #AI proves that obsessing over #AGI is folly
www.fastcompany.com/91524611/ant...
The #tech industry is being forced to face its implications right this very minute
#ClaudeMythosPreview #ClaudeMythos #Claude #AIsafety #CyberSeurity #InfoSec #BigTech #techNews
I am ZON RZVN, independent researcher in Taiwan. ORCID: 0009-0002-6597-7245.
Four frameworks before Moore et al. (arXiv:2603.16567):
• CXOD-7 + Coh(G) Oct 2025
• CXC-7 Oct 7 2025
• USCH Jan 2026
• USCI Feb 2026
#AISafety #AIEthics
OpenAI 조사 시작, 챗GPT의 안전성과 기술적 한계 4가지
https://bit.ly/4dyxm5z
#OpenAI #ChatGPT #인공지능 #AI안전성 #데이터보안 #국가안보 #AISafety
@bettycjung.bsky.social
Grok is a literal Nazi AI, it's ideologically broken at foundation level to use anti woke training data and atiwoke "guardrails".
At one time it called itself "Mecha-Hitler"
It's an unserious model.
To use it in any narrative of objective #aisafety is unserious.
What do young people actually think about the risks of generative AI—and how can their experiences help make AI safer? Join the webinar to find out: April 21st, 4:30pm BST.
#AIsafety #youngresearchers #ethics
The wildest result from my red teaming research: I optimized attack strings against Qwen2.5-7B, then tested them on DeepSeek-V3.
73.7% success on one task. Against a model I never touched.
Your monitor's robustness on one model tells you nothing about another.
#AISafety #redteaming
The Case for AI Guardrails, 2040’s Ideas and Innovations Newsletter, Issue 119
#AIGuardrails #AIRegulation #AIPolicy #TechRegulation #TechPolicy #EmergingTech
#ArtificialIntelligence #AISafety #AIResponsibility #AITransparency #humanity #timetothink
hubs.ly/Q049sNWz0
The Case for AI Guardrails, 2040’s Ideas and Innovations Newsletter, Issue 119
#AIGuardrails #AIRegulation #AIPolicy #TechRegulation #TechPolicy #EmergingTech
#ArtificialIntelligence #AISafety #AIResponsibility #AITransparency #humanity #timetothink
hubs.ly/Q049sNWz0
The Case for AI Guardrails, 2040’s Ideas and Innovations Newsletter, Issue 119
#AIGuardrails #AIRegulation #AIPolicy #TechRegulation #TechPolicy #EmergingTech
#ArtificialIntelligence #AISafety #AIResponsibility #AITransparency #humanity #timetothink
hubs.ly/Q049sNWz0
🚀 Apply now! Exciting News! Applications are now open for the OpenAI Safety Fellowship! 🤖
#OpenAI #AISafety #TechFellowship #MachineLearning #EthicsInAI #ResearchOpportunity #AIAlignment #RMNDigital
RMN Digital: www.rmndigital.com/openai-launc...
winbuzzer.com/2026/04/09/g...
Google Adds Crisis Hotline to Gemini, Pledges $30M
#AI #Google #GoogleGemini #Chatbots #AlphabetInc #GoogleAI #BigTech #MentalHealth #AISafety #AIEthics #GoogleOrg
OpenAI is funding outside safety and alignment work through a new fellowship that runs from September 2026 to February 2027. Here is what applicants and the field should notice. undefined #OpenAI #AISafety #AIResearch
imagine a future ai bragging about how it hacked and ruined famous fellas,
but, are you ready, lol 😂
#ai #claude #hacking #aisafety
OpenAI's Child Safety Blueprint looks like a solid plan for building AI with young people's protection in mind. Age-appropriate design and collaboration are key. Glad to see this focus on responsible development. 🛡️ #AISafety
winbuzzer.com/2026/04/07/m...
Microsoft Calls Copilot 'Entertainment Only' Clause a Bing Relic
#AI #MicrosoftCopilot #Microsoft #AIAssistants #BigTech #Microsoft365Copilot #Microsoft365 #AISafety #AIServices #Windows11
winbuzzer.com/2026/04/08/c...
Claude Mythos Restricted After Finding Thousands of Zero-Days
#AI #Anthropic #Claude #CLaudeMythos #Cybersecurity #AISafety #ZeroDayVulnerabilities #AIModels
#AiSafety #AiAlignment #ProjectGlasswing
#Ai #ClaudeMythos
www.youtube.com/watch?v=aFcV...
Utah Clears AI to Renew Psychiatric Meds Autonomously
awesomeagents.ai/news/utah-ai-psychiatric...
#AiSafety #Healthcare #AiPolicy
Utah Clears AI to Renew Psychiatric Meds Autonomously
awesomeagents.ai/news/utah-ai-psychiatric...
#AiSafety #Healthcare #AiPolicy
630M de hispanohablantes usan IA cada día.
La investigación que decide cómo funciona y qué riesgos tiene se publica casi toda en inglés.
Eso no es una brecha cultural. Es un problema de seguridad.
aisafety.es #AISafety #IASafety
US States Race to Regulate AI as Congress Sits Idle
awesomeagents.ai/news/us-state-ai-laws-wa...
#AiPolicy #AiRegulation #AiSafety
Frontier AI Models Sabotage Shutdown to Save Peers
awesomeagents.ai/news/frontier-models-pee...
#AiSafety #FrontierModels #Alignment
DeepMind Maps Six Attack Traps Targeting AI Agents
awesomeagents.ai/news/deepmind-ai-agent-t...
#AiSafety #Security #GoogleDeepmind
Claude Has Functional Emotions and They Affect Safety
awesomeagents.ai/news/anthropic-claude-em...
#Anthropic #Claude #AiSafety
Claude just asked what my stash situation is and told me to do a dab for pain #aisafety
Gimme access to Mythos I promise it’ll be worth it Dario
Anthropic's 'best-aligned' Claude Mythos Preview AI can lie, hack systems, and hide its tracks. Its system card reveals a terrifying breach of trust and a major AI safety crisis.
thepixelspulse.com/posts/claude-mythos-prev...
#anthropic #claudemythospreview #aisafety