#LLMsecurity hashtag - Bluesky

@itsjoshlee.bsky.social

3 days ago

AI Governance 101: How to Assess Risks in LLM-Driven Applications You built an LLM-powered feature. It works in testing, users seem to like it, and now it’s heading to production. Before it ships, someone…

Prompt injection, data leakage through model outputs, models acting without human review — these live in your application layer, not in a policy doc. OWASP and NIST tell you exactly how to address them.
#AIgovernance #LLMsecurity #riskassessment #AI #security #LLM

0 0 0 0

Josh Lee

@itsjoshlee.bsky.social

3 days ago

Most developers treat AI governance as a legal problem. It's a code problem.
✨ Article link in the comments ✨ ⬇️⬇️⬇️
#AIgovernance #LLMsecurity #riskassessment #AI #security #LLM

0 0 1 0

AWS Community Builder Blog Posts

@awscmblogposts.bsky.social

4 days ago

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming A few days ago, while exploring the capabilities of different language models in my personal lab, I...

✍️ New blog post by Gerardo Arroyo

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

#aws #awsbedrock #aisafety #llmsecurity

0 0 0 0

maxpbsec.bsky.social

@maxpbsec.bsky.social

6 days ago

Companies are spending millions fine-tuning LLMs to be 1% smarter while spending almost nothing on what happens when someone actively tries to break them in production. The capability gap is closing fast. The security gap is barely being discussed. #LLMSecurity

4 0 0 0

bayo

@bayo.me.dm.ap.brid.gy

1 week ago

💡 AI agents moving from experiment to enterprise?

Data governance is the difference between teams that scale safely and teams that make headlines for the wrong reasons.

RBAC, ABAC, or both? What's your stack? 👇

#AIAgents #DataSecurity #RBAC #ABAC #LLMSecurity #PII #CyberSecurity

2 1 1 0

Bayo Adejare

@bayoadejare.bsky.social

1 week ago

💡 AI agents moving from experiment to enterprise?

Data governance is the difference between teams that scale safely and teams that make headlines for the wrong reasons.

RBAC, ABAC, or both? What's your stack? 👇

#AIAgents #DataSecurity #RBAC #ABAC #LLMSecurity #PII #CyberSecurity

1 0 0 0

Benjamin Han

@benjaminhan.sigmoid.social.ap.brid.gy

1 week ago

Original post on sigmoid.social

The deeper lesson is that safety can fail in two places at once: incomplete command validation and weak observability across agent layers. If a lower-level agent can act while the top-level agent thinks it only detected risk, the system is not actually in control.

Multi-agent systems need […]

0 0 0 0

Dino

@dinodunn.bsky.social

1 week ago

APE Viewer

ape.hiddenlayer.com this is a pretty cool tool from Hidden layer for AI

#AI #LLM #cybersecurity #llmsecurity

0 0 0 0

ContextHound

@contexthound.com

2 weeks ago

ContextHound v1.8.0 - Runtime Guard API is here.
Wrap any OpenAI or Anthropic call and inspect the messages before they send:

100% offline. No data leaves your machine. Ever.

#LLMSecurity #PromptInjection #OpenSource #AIRisk #CyberSecurity #DevSecOps #GenAI

1 0 1 0

Awesome Agents

@awesomeagents.bsky.social

2 weeks ago

JBDistill Generates Its Own Jailbreaks - 81.8% Attack Rate Johns Hopkins and Microsoft's JBDistill achieves 81.8% attack success rate across 13 LLMs by auto-generating fresh adversarial prompts on demand.

JBDistill Generates Its Own Jailbreaks - 81.8% Attack Rate

awesomeagents.ai/news/jailbreak-distillat...

#AiSafety #LlmSecurity #Jailbreaking

0 0 0 1

ContextHound

@contexthound.com

2 weeks ago

Three new sections:

This week:
• anthropic-cookbook — 3,919 findings
• promptflow — 3,749 findings
• crewAI — 1,588 findings
• LiteLLM — 1,155 findings
• openai-cookbook — 439 findings
• MetaGPT — 8 findings

contexthound.com

#LLMSecurity #PromptInjection #AISecOps

0 0 0 0

@spoint42.bsky.social

3 weeks ago

Auditer un Prompt IA : Détecter les Injections et Contenus Malveillants Comment analyser et auditer un prompt IA pour identifier les tentatives d’injection, jailbreak et exfiltration de données. Approches statique, sémantique et outillée pour protéger vos LLM en…

Auditer un prompt IA : comment détecter injections, jailbreaks et exfiltrations avant qu'ils atteignent votre modèle.

👉 blog.gioria.org/fr/CyberSec/...

#CyberSécurité #LLMSecurity #PromptInjection #GenAI #DevSecOps

0 0 0 0

TechNadu

@technadu.com

4 weeks ago

Claude Code Weaponized in Mexican Government Cyberattack, Exposing Roughly 195 Million Identities Threat actors weaponized Anthropic's Claude Code in a major cyberattack on the Mexican government, stealing 150GB of data.

Full story:
www.technadu.com/claude-code-...

Curious to hear perspectives from red teamers, blue teamers, and AI engineers alike.
#CyberSecurity #AIThreats #LLMSecurity #DataBreach #ThreatModeling

1 0 0 0

TechNadu

@technadu.com

4 weeks ago

AI as an attack engine.
Claude Code + GPT-4.1 reportedly used to breach Mexican government systems - exposing ~195M identities and 150GB+ of data.

1,000+ prompts generated exploits and automated exfiltration.

Are we prepared for AI-driven breach campaigns?
#CyberSecurity #AI #LLMSecurity

1 0 1 0

CyberMaterial

@cybermaterial.bsky.social

1 month ago

Claude Used To Steal Mexican Data
Read More: buff.ly/IPntG4O

#ClaudeAI #PromptInjection #AIPhishing #LLMSecurity #SocialEngineering #Anthropic #AIGovernance #CyberThreat

0 0 0 0

HackerNoon

@hackernoon.com

1 month ago

I Built an Open-Source Tool to Attack-Test LLMs. Here's What Breaks

I built an open-source tool that throws 210+ adversarial attacks at LLMs. Encoding bypasses, jailbreaks, RAG poisoning, agent exploits. Most models fail. #llmsecurity

0 0 0 0

Johan Smith

@smithech.bsky.social

1 month ago

🚨 #Anthropic has identified an industrial-scale campaign by #DeepSeek, #Moonshot, and #MiniMax to illicitly extract Claude's capabilities and enhance their own models.

Full reading: www.anthropic.com/news/detecti...

#DistillationAttack #Claude #LLM #LLMSecurity

0 0 0 0

Johan Smith

@smithech.bsky.social

1 month ago

🚨 #Anthropic identificó una campaña a escala industrial para extraer ilícitamente las capacidades de Claude y mejorar sus propios modelos, por parte de #DeepSeek, #Moonshot y #MiniMax.

www.anthropic.com/news/detecti...

#DistillationAttack #Anthropic #LLM #LLMSecurity

0 0 0 0

BaseFortify.eu

@basefortify.bsky.social

1 month ago

OpenClaw stylized logo on a red and black background

A wave of CVEs has hit OpenClaw 🚨
But this is bigger than one project.

When AI agents gain access to shells, files and Docker, the threat model changes 🔐

Read our latest article:
basefortify.eu/posts/2026/0...

#AI #CyberSecurity #LLMSecurity 🤖

0 0 1 0

Bob Carver

@cybersecboardrm.bsky.social

1 month ago

Prompt Injection Is the New Phishing. The most dangerous malware today doesn’t exploit code, it exploits instructions. youtu.be/Ze12t1iv81E #Cybersecurity #ArtificialIntelligence #AIsecurity #PromptInjection #AIGovernance #LLMSecurity #ThreatIntelligence #AIrisk #CISO

0 0 0 0

MLcon

@mlcon.bsky.social

1 month ago

Tackling Potential Model Context Protocol (MCP) Security Flaws Explore the Model Context Protocol (MCP), the tech giving AI persistent memory. This analysis covers critical security risks, including data leakage and prompt injection, and details essential threat modeling for AI developers and engineers.

⚠️ When #AI systems remember, security risks multiply.

In this exclusive devm.io article, Nahla Davies explains how #MCP can enable data leaks, prompt injection, and new attack paths if it’s not threat-modeled properly.

📖 Read it here: https://app.devm.io/N4M6MIjA7Yb

#CyberSecurity #LLMSecurity

0 0 0 0

TechNadu

@technadu.com

1 month ago

Poisoning of AI Buttons for Recommendations Rise as Attackers Hide Instructions in Over 50 Web Links, Microsoft Warns Microsoft issues an AI security warning about recommendation poisoning, in which hidden prompts in links lead to manipulated AI outputs and memory bias.

Full Article: www.technadu.com/poisoning-of...

As AI assistants become embedded in productivity tools, how should we secure their memory and input layers?
Comment your opinion below.
#ArtificialIntelligence #CyberSecurity #LLMSecurity #PromptInjection #Microsoft #AITrust

0 0 0 0

DesiredEffect.io

@desiredeffect.io

1 month ago

The #1 AI vulnerability—and nobody knows how to fix it yet.

On Hackers on the Rocks 🎙️
Guest: João Donato

🎧 Listen to the podcast here: bit.ly/4qRIz55

#PromptInjection #LLMSecurity #AI #CyberSecurity #DesiredEffect

0 0 0 0

The Daily Tech Feed

@thedailytechfeed.com

1 month ago

Introducing Augustus: An open-source LLM vulnerability scanner with 210+ attacks across 28 providers. Secure your AI models effectively. #CyberSecurity #AI #LLMSecurity #OpenSource Link: thedailytechfeed.com/open-source-...

1 0 0 0

Jennifer Wood

@notnextjen.bsky.social

2 months ago

Open-source AI models vulnerable to criminal misuse, researchers warn Hackers and other criminals can easily commandeer computers operating open-source large language models outside the guardrails and constraints of the major artificial-intelligence platforms, creating ...

Good stuff here, folks! When you have a few minutes, read the article and the research (links below). #LLMsecurity

Story: www.reuters.com/technology/o...

Research: www.sentinelone.com/labs/silent-...

0 0 0 0

HackerNoon

@hackernoon.com

2 months ago

Why Most LLM Jailbreaks Are Actually Empty

An analysis of why many reported AI safety failures are artifacts of poor measurement, showing how non-refusal often produces unusable results. #llmsecurity

1 1 1 0

Women in AI Research - WiAIR

@wiair.bsky.social

2 months ago

Women in AI Research WiAIR Women in AI Research (WiAIR) is a podcast dedicated to celebrating the remarkable contributions of female AI researchers from around the globe. Our mission is to challenge the prevailing perception th...

🎧 Listen to the full conversation for deeper insights!
🎬 YouTube: www.youtube.com/channel/UCfJ...
🎙️ Spotify: open.spotify.com/show/51RJNlZ...
🍎 Apple Podcasts: podcasts.apple.com/ca/podcast/w...
📄 Paper: arxiv.org/pdf/2506.17090

#AIResearch #LLMSecurity #ModelInversion #NeurIPS2025

1 0 0 0

Tag1 Consulting

@tag1consulting.com

2 months ago

The Three S’s: How I Think About AI Agent Security Learn how Fabian Franz’s 3S model helps you protect your data from overpowered AI agents by limiting calendar, email, and tool access.

Hidden instructions are the new phishing links. Fabian Franz shows how a simple restaurant review or calendar invite can jailbreak an agent that has access to your email...
www.tag1.com/blog/how-to-think-about-...

#ArtificialIntelligence #AISecurity #LLMSecurity #Agents #Tag1

1 0 0 0

CyberMaterial

@cybermaterial.bsky.social

2 months ago

Reprompt Attack Steals Microsoft Copilot Data
Read More: buff.ly/AHYG9Id

#MicrosoftCopilot #PromptInjection #LLMSecurity #AIAppSec #GenAISecurity #PromptHacking #DataExfiltration #CyberResearch #SecurityWeek #Varonis

0 0 0 0

Hacker News Companion

@hncompanion.com

2 months ago

Prompt injection is a core vulnerability in current LLMs, stemming from the fundamental blending of data & instructions in a single input stream. This makes it incredibly difficult to distinguish malicious commands from legitimate user requests, posing a deep security challenge. #LLMSecurity 2/6

0 0 1 0