G/ (@nls.io) Bsky - nopzon.com

Two years later, researchers from OpenAI, Anthropic, and Google bypassed every published defense at 90%+ success rates. There is no deterministic fix for prompt injection.

I wrote up what I shipped, what I skipped, and why. Let me know if I missed anything important!

1 month ago 0 0 0 1

Defending LLM chatbots against prompt injection and topic drift Practical defenses for LLM chatbots against prompt injection and topic drift, including system-prompt hardening, input validation, output filtering, topic control, and pragmatic architecture choices.

I wanted to build a lead-catching LLM based chatbot but I had to figure out which defenses were actually worth the complexity.

A Chevrolet chatbot agreed to sell a $76K Tahoe for $1 after one prompt injection.

guillaume.id/blog/defendi...

1 month ago 0 0 1 0

GitHub - danielmiessler/Telos: Telos is an open-sourced framework for creating Deep Context about things that matter to humans. Telos is an open-sourced framework for creating Deep Context about things that matter to humans. - danielmiessler/Telos

Quickbit: If you're struggling defining priorities and goals (as a person, a company), take a look at Daniel Miessler's Telos framework. Spent the last hour talking to my laptop to build it! It really helps. github.com/danielmiessl...

1 month ago 0 0 0 0

The research is in: your AGENTS.md is probably too long Research shows comprehensive AGENTS.md files actually hurt coding agent performance. Learn why less is more and how to build context files that work.

I wrote up the full paper breakdown with a practical workflow here: devcenter.upsun.com/posts/agents...

#codingagents #softwareengineering #aiengineering #claude

1 month ago 0 0 0 0

The agents weren't ignoring the files. They followed every line too literally, treating each instruction as another constraint.

What worked: start empty, watch the agent fail, add one rule when you see the same mistake twice. Five lines that fix real problems.

1 month ago 0 0 1 0

AGENTS.md AGENTS.md is a simple, open format for guiding coding agents. Think of it as a README for agents.

Your agents.md is probably hurting more than it helps. A new ETH Zurich study tested coding agents across hundreds of real GitHub issues and found that LLM-generated context files reduced success rates by 3% while adding 20% to inference costs.

1 month ago 0 0 1 0

Really let you think twice of what you can build that can't be easily replicated. And I guess that's where you need to have more human services, support, governance, sovereignty because the product itself may be just a commodity.

#ai #builder #indiedev #claude #anthropic

1 month ago 3 0 0 0

Many startups started doing this a few months back. It's insane and scary to see how fast Anthropic and OpenAI can replicate a working business and just kill it. Because like Google 10 years ago, there is no reason to have multiple providers when you can just use one instead.

1 month ago 2 0 1 0

Claude Code Security | Anthropic by Claude From scan to fix, seamlessly. Claude scans your codebase for vulnerabilities, validates findings, and recommends patches you can review and approve.

I love initiatives like Claude Code Security (even if it's gated for now). But I'm also terrified of them.

claude.com/solutions/cl...

1 month ago 0 0 2 0

How I got SuperWhisper-quality voice typing on my Linux desktop Get near‑SuperWhisper voice typing on Linux with hyprvoice + whisper.cpp: offline, fast, Wayland‑friendly, and accurate enough to replace your keyboard.

But transcription quality is comparable, speed might be better on a desktop GPU, and everything stays local. For a free open-source tool it fills a real gap. Full walkthrough with build instructions, GPU setup, and compositor keybindings: guillaume.id/blog/how-i-g...

1 month ago 0 0 0 0

The accuracy surprised me too. Technical vocab, proper nouns, correct punctuation. Good enough that first drafts need less editing than what I type. Honest take: SuperWhisper has better UX. Visual feedback, mode system, polish. hyprvoice is a daemon you toggle from a hotkey. No GUI, no frills.

1 month ago 0 0 1 0

Despite the name, it works on any Wayland compositor. I run it on niri. You can point it at cloud APIs or run everything locally with whisper.cpp. With whisper.cpp and the large-v3-turbo model on GPU: sub-100ms transcription latency. Even after a full minute of speech. It felt wrong, it was so fast.

1 month ago 0 0 1 0

Linux options have been rough. Electron wrappers, X11 hacks that break on Wayland, or shipping your audio to the cloud. Nothing survived daily use. Then I found hyprvoice. Go daemon that captures audio via PipeWire, runs it through a transcription backend, and injects text into your focused window.

1 month ago 0 0 1 0

Average typing speed: 40 WPM. Conversational speech: 130-150 WPM. Your fingers literally can't keep up with your brain. Voice typing fixes that gap. SuperWhisper nails this on macOS. Press a hotkey, talk, text appears. Accurate, offline, fast. But it's Mac-only with no Linux port planned.

1 month ago 1 0 1 0

I am not a big fan of OpenClaw (yet) but I find tremendous value in automating simple tasks.

I've worked yesterday and my own terminal based assistant: n6. In honor of the beloved Caprica Six.

Extensible python terminal utility wrapping Claude.

github.com/gmoigneu/n6

1 month ago 0 0 0 0

And you can obviously inject this into any coding agent.

1 month ago 0 0 0 0

`npx -y react-doctor@latest .`

#reactjs #javascript #softwareengineering #codequality #opensource

1 month ago 2 0 1 0

You can even hook it into an agent to fix the flagged issues automatically. If you maintain a React codebase, run it before your next sprint planning. Takes 30 seconds. Output is clean enough to act on immediately.

1 month ago 0 0 1 0

4 react-doctor is an open-source CLI from Million.co. One command scans your React project for security issues, performance problems, correctness bugs, and architecture smells. Score, file paths, line numbers, actionable diagnostics. Zero config.

1 month ago 0 0 1 0

I ran react-doctor on a real production codebase. 88 out of 100. 4 errors across 139 files. Better than I expected and the 4 errors are exactly the kind that bite you in production.

1 month ago 0 0 1 0

Weekly AI Review - 2026-02-17 Anthropic closed a staggering $30 billion Series G at a $380 billion valuation. The second-largest venture deal in history as Claude Code's revenue doubled to $2.

I'm starting a new AI Weekly series. I was doing that for my own needs but I think it's worth sharing. If you have any good source to add to my tracker let me know!

www.linkedin.com/pulse/weekly...

1 month ago 0 0 0 0

AI Coding Agent Readiness Checklist AI Coding Agent Readiness Checklist. GitHub Gist: instantly share code, notes, and snippets.

If you want to see where you stand, here my checklist: gist.github.com/gmoigneu/a96...

1 month ago 0 0 0 0

Making coding agents (Claude Code, Codex, etc.) reliable The bottleneck for AI coding agents isn't model capability. It's your verification infrastructure. Here's how to fix that.

My personal take on how to make coding agents reliable.
For me it's all about verification.

The limit isn’t the AI. It’s your infrastructure. Build the verification layer, and the autonomy follows.

The full article: devcenter.upsun.com/posts/making...

1 month ago 1 0 1 0

GitHub - blader/napkin: A Claude Code skill that gives the agent persistent memory of its mistakes via a per-repo markdown scratchpad. A Claude Code skill that gives the agent persistent memory of its mistakes via a per-repo markdown scratchpad. - blader/napkin

By session three, it stops making the same mistakes. By session five, it's catching things before you do. Install it once. It works automatically after that.

github.com/blader/napkin

2 months ago 0 0 0 0

It's a simple skill that gives the agent a memory. A markdown file in your repo where it writes down what went wrong and what you corrected. Next session, it reads that file first.

2 months ago 0 0 1 0

You know that moment when Claude forgets you use pnpm? Again. For the third time today. I got tired of repeating myself. And then I found napkin.

2 months ago 1 0 1 0

If you want the slides from my hashtag#AIDay talk about making coding agents reliable, here you go: guillaume.id/pdfs/2026-02...

And the 8 pillars verification checklist: gist.github.com/gmoigneu/a96...

Thank you Anne-Sophie Norca from dotAI by dotConferences for organizing this!

2 months ago 0 0 0 0

Write-Only Code | Heavybit AI may be accelerating software into a world of

Refusing to adapt won't preserve safety. It just means the adaptation happens accidentally instead of intentionally. Choose wisely. Full breakdown by Joseph Ruscio: www.heavybit.com/library/arti...

2 months ago 0 0 0 0

Success = building systems that stay correct WITHOUT requiring someone to comprehend every single line of code.

The orgs investing earliest in new primitives for trust and accountability? They're going to dominate.

2 months ago 0 0 1 0

What primitives will replace human authorship and review as the foundation of trust?

Teams will track "code reading coverage" metrics. But here's the kicker: they'll strategically drive it DOWN in safe areas.

2 months ago 1 0 1 0

Posts by G/