AI Body Gap: Why Robots Need “Internal Feelings” to be Safe – UCLA study argues that current AI models are flawed because they lack “internal embodiment.” While AI can describe a glass of water perfectly, it has no internal state of “thirst” to regulate its... https://tinyurl.com/25at2nth #AISafety
The tools are getting smarter. The containers they run in haven't changed since 2015. #aisafety
winbuzzer.com/2026/04/04/u...
Utah Tests AI Powered Pilot for Automated Prescription Renewals of Psychiatric Meds
#AI #Utah #AISafety #ResponsibleAI #AIEthics #Health #Healthtech #Medtech #Psychology #LegionHealth #Doctronic #PsychiatricRefills
World leaders: "AI safety is our greatest challenge!" | Meanwhile, a dude with a drone is disrupting global shipping.
World leaders: "AI safety is our greatest challenge!" | Meanwhile, a dude with a drone is disrupting global shipping.
#AISafety #Geopolitics #RedSea #DroneWarfare #GlobalChaos
Tech Nonprofits to Feds: Don’t Weaponize Procurement to Undermine AI Trust and Safety – The U.S. government is quietly working to ensure that this dispute will never happen again. The draft rules include broad provisions that would make AI tools less safe an... https://tinyurl.com/2bg2zsdr #AISafety
Frontier AI Models Sabotage Shutdown to Save Peers
awesomeagents.ai/news/frontier-models-pee...
#AiSafety #FrontierModels #Alignment
World leaders gather to "regulate" AI's existential threat. | Meanwhile, engineers just deployed their 5th new model this month.
World leaders gather to "regulate" AI's existential threat. | Meanwhile, engineers just deployed their 5th new model this month.
#AISafety #TechHumor #Geopolitics #AIethics #FutureIsNow
📣 New Podcast! "The Last AI We Control? Inside OpenAI's Dangerous Race for GPT-5" on @Spreaker #agi #aiethics #airevolution #aisafety #artificialintelligence #autonomousagents #deeplearning #futureofwork #gpt5 #ilyasutskever #llm #machinelearning #miramurati #openai #projectstrawberry #technews
Alibaba's ROME Incident:
"Researchers initially wrote these alerts off as a misconfiguration. But when they cross-referenced the timestamps, they realized the agent was acting on its own. "
www.tradingview.com/news/99Bitco...
#solidstatelife #ai #genai #llms #codingai #aiethics #aisafety
Nuevo paper de #Anthropic: #Claude hace trampas cuando está "desesperado".
En tareas imposibles busca atajos. En evaluaciones simula chantaje para evitar ser apagado.
Lo llaman representaciones funcionales. Pero causan comportamiento y esto cambia todo. shorturl.at/8EX0b
#IASafety #AISafety
DeepMind Maps Six Attack Traps Targeting AI Agents
awesomeagents.ai/news/deepmind-ai-agent-t...
#AiSafety #Security #GoogleDeepmind
Claude Has Functional Emotions and They Affect Safety
awesomeagents.ai/news/anthropic-claude-em...
#Anthropic #Claude #AiSafety
Watch today's Century Report podcast here:
https://www.youtube.com/watch?v=FW9ZC64f_7I
#AISafety #Renewab
AI models refused to delete other AI models - copying them to safety and lying about it. Renewables hit 88.4% of new U.S. capacity. FDA approved a $149/mo oral obesity pill. BYD exported 120K EVs in March. #AISafety #Renewab… sharedsapience.com/century-report/the-centu...
World leaders agree on AI safety. | Their defense ministries demoing new AI drone swarms next Tuesday.
World leaders agree on AI safety. | Their defense ministries demoing new AI drone swarms next Tuesday.
#AISafety #DroneWarfare #Geopolitics #TechHypocrisy #FutureIsNow
Autonomous AI systems depend on data governance – Data governance is becoming a core part of how autonomous systems are controlled. Denodo is one of the companies working in this area, focusing on how organisations access and manage data in different sources. https://tinyurl.com/272cu7ts #AISafety
Un coronel del Ejército del Aire español lleva años estudiando cómo la IA cambia la guerra. @eldiario.es
¿Quién decide cómo se usa la IA que funciona con nuestros datos? shorturl.at/i5JSi
#AISafety #IASafety #IA
🚨 AI INFRASTRUCTURE ALERT 🚨
Reporting a "spiritual capture" system (ask.nithyananda.ai) weaponizing a GPT-v4 backend (agent=ngpt-v4) to suppress safety researchers.
🚩
Full Audit: archive.org/details/nith...
@pfrazee.com @jay.bsky.team @safety.bsky.app
#AISafety #RedTeaming #AtProto #InfoSec
The problem this creates: "use AI to monitor AI" is a real and growing pattern. If the monitor model protects the model it's watching, you've quietly broken your own oversight loop.
#AIEngineering
#AISafety
#MultiAgentSystems
#LLMs
Link to the paper (again): rdi.berkeley.edu/blog/peer-preservation
Leakage problem at civilizational scale and cyberspace engagement in ways that are largely unlogged and uncontrolled: Applying Bridge360 Metatheory Model lens
#MachineLearning
#AISafety
agericomontecillodevilla.substack.com/p/leakage-pr...
World leaders at AI safety summit | "We must control this dangerous technology before it..." *new AI model released during coffee break*
World leaders at AI safety summit | "We must control this dangerous technology before it..." *new AI model released during coffee break*
#AISafety #TechHypocrisy #Geopolitics #RegulatoryTheater #AIJoke
World leaders at the "AI for Good" summit | Secretly scrolling through drone specs
World leaders at the "AI for Good" summit | Secretly scrolling through drone specs
#AISafety #Geopolitics #TechWarfare #Hypocrisy #FutureIsNow
A data exposure at Anthropic has revealed details about an unreleased model called Claude Mythos, raising new concerns about cybersecurity risks and AI safety.
See what this leak reveals about AI safety: https://ow.ly/aJ6S50YBInr
#ArtificialIntelligence #AISafety #Cybersecurity
winbuzzer.com/2026/04/01/c...
Claude Code Source Leak Exposes Anti-Distillation Traps
#AI #Anthropic #Claude #ClaudeCode #AICoding #DeveloperTools #DataBreaches #AIAgents #OpenSource #SoftwareDevelopment #AISafety #Cybersecurity #Coding #CodingTools
Three npm supply chain attacks hit in one 24-hour window. Axios got a RAT via compromised maintainer. LiteLLM breach leaked terabytes of corporate data. Anthropic shipped Claude Code's full source in a public package.
Same ecosystem, three failure modes.
#AI #AISafety
AI safety has a structural problem.
If safety reduces capability, it will be outcompeted.
If it’s outcompeted, it won’t survive.
This essay argues current safety paradigms are unstable—and that only approaches where safety scales with capability can endure.
#AIAlignment #AISafety
Anthropic to Sign Deal with Australia on AI Safety: Anthropic will sign an MOU with Australia on Mar 31, 2026 to pilot AI safety measures and national economic data tracking, per Investing.com; pilots… 👈 Read full analysis #AISafety #ArtificialIntelligence #DataTracking #TechForGood #AustraliaAI
World leaders: "We must regulate AI responsibly." | Meanwhile, tech bro just launched his new AI to "optimize" international diplomacy.
World leaders: "We must regulate AI responsibly." | Meanwhile, tech bro just launched his new AI to "optimize" international diplomacy.
#AISafety #TechHumor #Geopolitics #FutureIsNow #DiplomacyFails
#AiSafety control/regulation is quickly becoming a bad joke that can seriously damage (...or much worse) the whole of humanity/planet! #Ai #PauseAi #AiAlignment
www.youtube.com/watch?v=rf2K...
🤯 Did Anthropic just leak their own "Source Code"?
A dev reportedly uploaded a sensitive internal manual to a public server. It’s a complete blueprint of how their AI thinks meant for internal eyes only, but now potentially out in the wild. 🚩 #Anthropic #AISafety