Advertisement ยท 728 ร— 90

Posts by David vonThenen

Post image Post image

So we finally admit itโ€ฆ scaling alone isn't saving anyone ๐Ÿ’ธ

My keynote at #AIDevSummit dives into Less Compute, More Impact โšก
Will also participate in a panel on what actually scales in Agentic AI ๐Ÿค–

Alsoโ€ฆ free passes (yes, actually free) ๐ŸŽŸ๏ธ ๐Ÿ‘‰ bit.ly/4cwaYro

2 days ago 0 0 0 0
Post image

I'll be presenting at @devoxxgreece.bsky.social ๐ŸŽ‰

Session 1: The Sound of Your Secrets - will discuss acoustic side-channel attacks/defenses.

Session 2: How Model Quantization Fuels the Next Wave of Agentic AI - will discuss efficient AI systems and models

Info: bit.ly/4c9EjZG

5 days ago 0 0 0 0

As AI moves from demos to production, teams are prioritizing efficiency, smaller models, and agentic systems. Learn whatโ€™s driving the shift and how to build smarter AI workflows. #DataScience #AI #ArtificialIntelligence opendatascience.com/the-shift-to...

1 week ago 1 1 0 0
Video

AI is forced to be derivative, so can it really capture the creativity of humans? John Romero says his development team created Doom because of who they were as people, and AI could not re-create it.

See more from John Romero at www.wearedevelopers.com/en/magazine/...

1 week ago 1 1 0 0
Post image

3 weeks until @odsc.bsky.social East! Will present 2 sessions...

Session 1: Less Compute, More Impact: How Model Quant Fuels the Next Wave of Agentic AI

Session 2: Train Your Own Sm. Lang Model: A Hands-On Workshop in Design, Distillation, and Deployment

๐Ÿ‘‰ bit.ly/4tzWn54

1 week ago 0 0 0 0
Post image

2 weeks away from @devoxx.fr ! Will discuss "The Sound of Your Secrets: Teaching Your Model to Spy So You Can Learn to Defend" and how keystroke sounds can reveal what you type, and how to fight back.

๐Ÿ“… Wed, Apr 22 @ 14:35
๐Ÿ“ Room: TBA
๐Ÿ“– Info: bit.ly/4lzLJbC

1 week ago 1 0 0 0
Post image

๐Ÿ“ข Speaker Announcement! Welcome David vonThenen, Senior AI/ML Engineer at NetApp, to #devbcn26! ๐Ÿš€ Get ready for an eye-opening session on AI security: "The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend". Grab Regular Tickets! ๐ŸŽŸ๏ธ buff.ly/AFSjWmY

2 weeks ago 1 1 0 0
Post image

Three weeks until @devoxxgreece.bsky.social, I'll be presenting...

โš™๏ธ Less Compute, More Impact: How Model Quantization Fuels The Next Wave Of Agentic AI - making agentic AI faster and cheaper with quantization.

๐Ÿ“… Friday, April 24 @ 17:15
๐Ÿ“ Room: MC 2
๐Ÿ“– Info: bit.ly/4sSLYC8

2 weeks ago 0 0 0 0
Post image

Three weeks until @devoxxgreece.bsky.social, I'll be speaking on...

๐Ÿ›ก๏ธ The Sound of Your Secrets: Teaching A Model To Spy, So You Can Learn to Defend - How AI can infer what you type from keyboard sounds.

๐Ÿ“… Saturday, April 25 @ 11:25
๐Ÿ“Room: Skalkotas
info: bit.ly/49riFPi

2 weeks ago 0 0 0 0
Post image

Bigger โ‰  better in production. The new @odsc.bsky.social blog breaks down why efficient, smaller models are winning ๐Ÿš€โšก + covers quantization, RAG pitfalls & real-world tradeoffs.

Grab 20% off ODSC East ๐ŸŽ‰

Read more ๐Ÿ‘‰ bit.ly/4bKQdHR

3 weeks ago 0 0 0 0
Advertisement
Post image

Excited to share that my session was accepted for @DevNetwork_ AI DevSummit 2026!

I'll be presenting "Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI."

๐Ÿ“… Conf Date: May 27 - 28
๐Ÿ“ Loc: San Fran
๐Ÿ”— Info: aidevsummit.co

3 weeks ago 0 0 0 0
Post image

Who doesn't love a good Chuck Norris joke?

Absolute legend!

4 weeks ago 0 0 0 1
Post image

Big models arenโ€™t always the answer ๐Ÿ˜… Smaller, focused SLMs can win on cost, latency, and control ๐Ÿš€

Join me this Tues, Mar 24 at 10am PT to break down fine-tuning, quantization, and real-world deployment strategies.

Details: bit.ly/4bwlPRC

4 weeks ago 0 0 0 0
Post image

Excited to share that I'll be speaking at @wearedevelopers.bsky.social World Congress Europe!

My session: "The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend"

๐Ÿ“… Dates: July 8-10
๐Ÿ“ Berlin, Germany
๐Ÿ”— Register: bit.ly/4bRiLzR

1 month ago 0 0 0 0
Post image Post image

When token costs get expensive.... you gotta do, what you gotta do ๐Ÿคฃ

1 month ago 0 0 0 0
Video

LLMs don't run out of compute firstโ€ฆ they run out of memory. ๐Ÿคฏ๐Ÿง 
KV cache, memory tiering, and shared storage are reshaping the economics of AI inference. I break down what's happening inside systems like vLLM + LMCache.

Read more: bit.ly/4bl87kn

#AIInference

1 month ago 0 0 0 0
Preview
Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI Editor’s note: David vonThenen is speaking at ODSC AI East this April 28th-30th. Check out his talk, “Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI,&#...

Model Quantization is reshaping AI economics by compressing large language models into smaller, faster, and more efficient systems. #AI #ArtificialIntelligence #DataScience opendatascience.com/less-compute...

1 month ago 4 1 0 0
Post image

People love AI demosโ€ฆ until production hits. ๐Ÿ˜…

On the Automate or Die Trying podcast, we break down the real challenges of enterprise AI: RAG accuracy, agent security, governance, and why "similarity" โ‰  correctness in production.

Watch the episode: bit.ly/3PxRryI

1 month ago 1 0 0 0
Post image

Spent years chasing bigger modelsโ€ฆ turns out smarter engineering wins. ๐Ÿš€
I joined the @odsc.bsky.social AI X Podcast to talk about real-world production AI: RAG failures, quantization, SLMs, and building efficient systems that actually ship. ๐ŸŽง

Listen here: bit.ly/4ssrdMm

#AI

1 month ago 2 1 0 0
Advertisement
Video

. @socallinuxexpo.bsky.social starts tomorrow!
I'll be presenting:
โ€ข A Practical Guide to Training a Small Language Model: bit.ly/3LkKjo0
โ€ข The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend: bit.ly/4rlTiEP

Sneak peek below ๐Ÿ‘‡

1 month ago 0 0 0 0
Video

Bigger models used to win headlines. Now they win power bills โšก

On the @odsc.bsky.social blog, I break down how quantization and specialized SLMs are reshaping agentic AI around efficiency, not ego. It's about value per watt, not parameter count ๐Ÿš€

Read here: bit.ly/4s6iKye

1 month ago 1 0 0 0
Post image

Excited to join @WDI_conference 2026 ๐ŸŽ‰ My VoD session, โ€œRethinking RAG: How MCP and Agent2Agent Will Transform the Future of Intelligent Searchโ€, dives into governance, grounding & multi-agent design ๐Ÿš€

Register: bit.ly/474WJrs
Code: WID26SP20 ๐ŸŽŸ๏ธ

#GenAI

1 month ago 0 0 0 0
Video

1 week until @socallinuxexpo.bsky.social

Session: The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend
bit.ly/4rlTiEP

Session: A Practical Guide to Training a Small Language Model: Tokenizers, Training, and Real-World Pitfalls
bit.ly/3LkKjo0

1 month ago 0 0 0 0
Post image

Excited to share that my talk was accepted forย @devoxx.fr.ย I'll be presenting "The Sound of Your Secrets: Teaching Your Model to Spy So You Can Learn to Defend," all about acoustic side-channel attacks and defenses.

More details: www.devoxx.fr/

1 month ago 0 0 0 0
Preview
Chorouk Malmoum, thank you for sharing this! It's deeply validating to see this coming out of NVIDIA's own research. I've been saying this for almost a year now. The first time I put it on recordโ€ฆ | David vonThenen Chorouk Malmoum, thank you for sharing this! It's deeply validating to see this coming out of NVIDIA's own research. I've been saying this for almost a year now. The first time I put it on record was at Devoxx UK 2025. At the time, it was based on hands-on experience building and validating multi-agent systems. But in this field, experience alone doesn't move the needle. You need data. You need papers. You need proof. And now we have it. The idea that Small Language Models are better suited for agentic workflows makes sense from a software engineering perspective. Separation of concerns. Encapsulation. Instead of one giant "do everything" model, you create focused SLMs that act as subject matter experts. Each one handles a narrow domain. There's another benefit people don't talk about enough: control. A tightly scoped SLM is far more likely to say "I don't know" when it's outside its boundary. This is a good thing. A massive general model? It tends to guess. And it guesses confidently. In production systems, that's not intelligence. That's liability. The second paper, introducing NVIDIA's Orchestrator-8B, is the piece I've been waiting for. A router that decides when to escalate a problem/question versus when to call a cheap tool or a smaller model. I'm very interested in experimenting with Orchestrator-8B. If the benchmarks hold up, this could materially change how we design cost-efficient agent systems at scale. This news also just so happens to coincide with a workshop/tutorial I will be giving at Open Data Science Conference (ODSC) East (end of April) titled: ๐‹๐ž๐ฌ๐ฌ ๐‚๐จ๐ฆ๐ฉ๐ฎ๐ญ๐ž, ๐Œ๐จ๐ซ๐ž ๐ˆ๐ฆ๐ฉ๐š๐œ๐ญ: ๐‡๐จ๐ฐ ๐Œ๐จ๐๐ž๐ฅ ๐๐ฎ๐š๐ง๐ญ๐ข๐ณ๐š๐ญ๐ข๐จ๐ง ๐…๐ฎ๐ž๐ฅ๐ฌ ๐ญ๐ก๐ž ๐๐ž๐ฑ๐ญ ๐–๐š๐ฏ๐ž ๐จ๐Ÿ ๐€๐ ๐ž๐ง๐ญ๐ข๐œ ๐€๐ˆ . I will do my best to include the findings/learnings from Orchestrator-8B in that session. (Tentative) Session Date: Tuesday, April 28 Session Info: https://bit.ly/4rxXeTg Read Chorouk's full breakdown below for more information and links to the research. .

NVIDIA's research backs it: Small Language Models > giant LLMs for agentic workflows ๐Ÿ”ฅ Focused SLMs = better control, lower cost, fewer "confident guesses."

Orchestrator-8B as a smart router? Game changer. I'll cover this at @odsc.bsky.social East! ๐Ÿš€

More info: bit.ly/4aCsYPG

1 month ago 0 0 0 0

Really looking forward to SCaLE this year. Going to be a lot of fun (with learning some cool stuff)!

1 month ago 0 0 0 0
Preview
Automate or Die Trying | David vonThenen Recorded a great conversation last week with Wil Ramos (https://lnkd.in/gV9jQBUV) on the ๐€๐ฎ๐ญ๐จ๐ฆ๐š๐ญ๐ž ๐จ๐ซ ๐ƒ๐ข๐ž ๐“๐ซ๐ฒ๐ข๐ง๐  podcast. Wil, thanks for having me on. I appreciate the space to go deep on topics that don't always fit into a conference talk. The episode should be out in a week or two. I'll share the link here once it's live. Here's some of what we covered: ๐Ÿ‘‰ How to harden RAG and agent workflows so they act only on verifiable evidence - Grounded data - Clear audit trails - Preventing agents from drifting into hallucinated "actions" or "decisions" ๐Ÿ‘‰ What it takes to make agentic automation safe enough to run unattended - Guardrails - Checkpoints - Human-in-the-loop ๐Ÿ‘‰ And the broader state of AI right now. Where it's moving. Where it's messy. And where we need to be more disciplined. I've been listening to several episodes of ๐€๐ฎ๐ญ๐จ๐ฆ๐š๐ญ๐ž ๐จ๐ซ ๐ƒ๐ข๐ž ๐“๐ซ๐ฒ๐ข๐ง๐  , and they're worth your time. What I like most is the range of perspectives. Different guests, different takes, real-world lessons. It's people building and securing real systems. If you're into automation, security, or AI systems, subscribe to the podcast here: YouTube: https://lnkd.in/gRKCEAJw Spotify: https://lnkd.in/ggDUeAyR More soon, once the episode drops.

Recorded an episode of "Automate or Die Trying" Podcast with Wil Ramos! ๐Ÿš€ We went deep on hardening RAG + agent workflows... grounded data, audit trails, guardrails, human-in-the-loop.

Teaser post on LinkedIn: bit.ly/4qQiyCo

1 month ago 0 0 0 0
Preview
OpenClaw: The Viral AI Agent that Broke the Internet - Peter Steinberger | Lex Fridman Podcast #491 | David vonThenen I listened to the latest Lex Fridman Podcast episode: ๐Ž๐ฉ๐ž๐ง๐‚๐ฅ๐š๐ฐ: ๐“๐ก๐ž ๐•๐ข๐ซ๐š๐ฅ ๐€๐ˆ ๐€๐ ๐ž๐ง๐ญ ๐ญ๐ก๐š๐ญ ๐๐ซ๐จ๐ค๐ž ๐ญ๐ก๐ž ๐ˆ๐ง๐ญ๐ž๐ซ๐ง๐ž๐ญ ๐ฐ๐ข๐ญ๐ก ๐๐ž๐ญ๐ž๐ซ ๐’๐ญ๐ž๐ข๐ง๐›๐ž๐ซ๐ ๐ž๐ซ. If you're building agents, it's worth a listen. OpenClaw has exploded on GitHub. But what stood out to me was this (time: 23:26): ๐€๐ง๐ ๐ญ๐ก๐ž๐ง ๐ญ๐ก๐ž ๐š๐ ๐ž๐ง๐ญ ๐ฐ๐จ๐ฎ๐ฅ๐ ๐ฃ๐ฎ๐ฌ๐ญ ๐ฆ๐จ๐๐ข๐Ÿ๐ฒ ๐ข๐ญ๐ฌ ๐จ๐ฐ๐ง ๐ฌ๐จ๐Ÿ๐ญ๐ฐ๐š๐ซ๐žโ€ฆ ๐ˆ ๐ฃ๐ฎ๐ฌ๐ญ ๐›๐ฎ๐ข๐ฅ๐ญ ๐ข๐ญโ€ฆ ๐ข๐ญ ๐ฃ๐ฎ๐ฌ๐ญ ๐ก๐š๐ฉ๐ฉ๐ž๐ง๐ž๐. We're not talking about scripted automation anymore. We're talking about systems that change themselves. That's impressive. It's also a different level of power. Midway through, Lex raises the obvious issue (time: 52:50): ๐๐ซ๐จ๐ฆ๐ฉ๐ญ ๐ข๐ง๐ฃ๐ž๐œ๐ญ๐ข๐จ๐ง ๐ข๐ฌ ๐ฌ๐ญ๐ข๐ฅ๐ฅ ๐š๐ง ๐จ๐ฉ๐ž๐ง ๐ฉ๐ซ๐จ๐›๐ฅ๐ž๐ฆโ€ฆ ๐ญ๐ก๐ž๐ซ๐ž'๐ฌ ๐ฌ๐จ ๐ฆ๐š๐ง๐ฒ ๐ฉ๐จ๐ฌ๐ฌ๐ข๐›๐ข๐ฅ๐ข๐ญ๐ข๐ž๐ฌโ€ฆ ๐ง๐ฎ๐š๐ง๐œ๐ž๐ ๐š๐ญ๐ญ๐š๐œ๐ค ๐ฏ๐ž๐œ๐ญ๐จ๐ซ๐ฌ. Peter talks about progress, like scanning skills with VirusTotal. That's good. But the bigger point remains. OpenClaw is a privileged automation runtime. Your risk is dominated by: 1๏ธโƒฃ Credential exposure 2๏ธโƒฃ Network exposure 3๏ธโƒฃ Tool/skill supply chain 4๏ธโƒฃ Prompt injection and social engineering You're basically giving a script sudo on your machine. Except now it improvises. Please see: https://bit.ly/4aFyXDn And hopefully, you read the docs and you aren't running this on your actual machine, but some isolated cloud instance, VM, etc. To run OpenClaw safely, Peter's advice is clear (time: 1:00:45): ๐ˆ๐Ÿ ๐ฒ๐จ๐ฎ ๐ฆ๐š๐ค๐ž ๐ฌ๐ฎ๐ซ๐ž ๐ญ๐ก๐š๐ญ ๐ฒ๐จ๐ฎ ๐š๐ซ๐ž ๐ญ๐ก๐ž ๐จ๐ง๐ฅ๐ฒ ๐ฉ๐ž๐ซ๐ฌ๐จ๐ง ๐ฐ๐ก๐จ ๐ญ๐š๐ฅ๐ค๐ฌ ๐ญ๐จ ๐ข๐ญโ€ฆ ๐ข๐ง ๐š ๐ฉ๐ซ๐ข๐ฏ๐š๐ญ๐ž ๐ง๐ž๐ญ๐ฐ๐จ๐ซ๐คโ€ฆ ๐ญ๐ก๐ž ๐ซ๐ข๐ฌ๐ค ๐ฉ๐ซ๐จ๐Ÿ๐ข๐ฅ๐ž ๐Ÿ๐š๐ฅ๐ฅ๐ฌ ๐š๐ฐ๐š๐ฒ. Isolation is the safe move. And here's my 2 cents... If you isolate OpenClaw or any AI assistant completely... no personal data, no real integrations, no privileged API keys... how useful is it really? An AI assistant only becomes valuable when it knows who you are and can act on your behalf. That requires access. And access creates risk. Without that, you have a cool demo. Not a production system. Not a ๐ฌ๐š๐Ÿ๐ž AI assistant. I think it could be useful to perform long running tasks based on the knowledge contained within the LLM... those in the AI space with some know how, probably already have some equivalent of that. BUT, I have a feeling that with some of that OpenAI resource, a safe and production version might become a reality sooner rather than later. The episode is fascinating and very honest about both the power and the risks... and also some really really amazing piece of tech that is taking the internet by storm. Give it a listen (it's 3+ hours, but worth it): https://bit.ly/4c0iFra

Just listened to Lex Fridman w/OpenClaw's creator ๐Ÿคฏ๐Ÿค– Self-modifying AI agents are hereโ€ฆ and they're powerful.

But let's be real: security is a real concern ๐Ÿ”โš ๏ธ Privileged automation + access + personal info = serious risk.

I break it down here: bit.ly/4tTsmhJ ๐Ÿš€

1 month ago 0 0 0 0
Post image

Excited to share that my talk was accepted for NDC Copenhagen! I'll be presenting The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend.


๐Ÿ—“๏ธ Session Time/Date: Thurs, Jun 4 at 15:00
๐Ÿ“ Location: Room 4
๐Ÿ“– Session Link: bit.ly/4qvPWxT

2 months ago 0 0 0 0
Advertisement
Post image

Three weeks out and I can't wait to bring this one to @socallinuxexpo.bsky.social 23x in Pasadena, CA ๐ŸŽง๐Ÿ”

Check out my session: The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend

๐Ÿ—“๏ธ Sat, March 7 @ 14:30
๐Ÿ“ Room 101
๐Ÿ”— bit.ly/4rlTiEP

#SCaLE23x

2 months ago 0 0 0 0