Posts by Mitch Allen
youtube.com/watch?v=grdo...
Tokenmaxxing is some of the dumbest, inefficient, most wasteful tech bro BS that I’ve ever heard of. It’s right up there with KLOCs (ask ChatGPT).
As the story points out, the system can be easily gamed.
We should reward doing more with less. Not less with more.
When designing a nano SaaS, my new book describes 4 critical pillars. They are:
- The one-sentence description of your product
- The one user persona that it is for
- The one outcome it delivers
- The one reason someone would pay for it instead of using a workaround
My latest book is now available on Amazon!
Nano SaaS: The Solo Developer's Guide to Building, Launching, and Monetizing in Days
a.co/d/04LQjRVP
I’m not off to a good start selling my book through Lemon Squeezy. I had to remove the product link on LinkedIn because the price and buttons came up in what I think is Russian!
Linking direct from my home page is fine.
Though fine is a relative term after reading the comments about LS on Reddit.
AI isn’t replacing software developers.
AI is assisting software developers.
One thing holding Bluesky back is its inability to import simple videos from my phone.
Having to chase down a converter, upload, convert, download, move to it to my phone so I can post it - adds too much friction.
One thing that will kill any app is usability friction.
Anthropic wants you to think that AI is going to take over all software development and support. Yet their API keys and associated billing are an absolute mess. To the point where API keys are unusable and I want a refund.
Now I’m in a battle with their chatbot trying to get through to a human.
youtube.com/watch?v=YTLn...
Even Rick Beato thinks the future of AI is local.
venturebeat.com/orchestratio...
This morning, Microsoft announced "Copilot Cowork" a new cloud-based AI agentic automation tool within Microsoft's existing AI tool 365 Copilot, except now it can complete work on users' behalf across many Microsoft apps, instead of contained within each one.
www.fastcompany.com/91498615/sto...
#fastcompany
This is what happens when CEOs buy the hype.
youtube.com/shorts/tBWen...
Claude Code with their stingy token allocation put me on a timeout again. That frees me up to consider Open AI Codex as an alternative.
#codex #claude #anthropic #openai
github.com/openai/codex
Nearly 21K new AI agents go live on Ethereum, BNB Chain, and Solana
www.cryptopolitan.com/nearly-21k-n...
The guy just wants to nerd out: Peter Steinberger is the creator of OpenClaw, an open-source AI agent framework that's the fastest-growing project in GitHub history.
youtube.com/watch?v=NMBo...
Here is a breakdown of how OpenClaw works.
youtube.com/watch?v=CAbr...
A large comparison table showing benchmark performance across five model families, with columns labeled at the top: “Opus 4.6,” “Opus 4.5,” “Sonnet 4.5,” “Gemini 3 Pro,” and “GPT-5.2 (all models).” The Opus 4.6 column is visually highlighted with a light shaded background and rounded border. Rows list tasks and benchmarks on the left, with percentages or scores across models: “Agentic terminal coding (Terminal-Bench 2.0)”: Opus 4.6: 65.4% Opus 4.5: 59.8% Sonnet 4.5: 51.0% Gemini 3 Pro: 56.2% (54.2% self-reported) GPT-5.2: 64.7% (64% self-reported, Codex CLI) “Agentic coding (SWE-bench Verified)”: Opus 4.6: 80.8% Opus 4.5: 80.9% Sonnet 4.5: 77.2% Gemini 3 Pro: 76.2% GPT-5.2: 80.0% “Agentic computer use (OSWorld)”: Opus 4.6: 72.7% Opus 4.5: 66.3% Sonnet 4.5: 61.4% Gemini 3 Pro: — GPT-5.2: — “Agentic tool use (t2-bench)”: Retail: Opus 4.6 91.9%, Opus 4.5 88.9%, Sonnet 4.5 86.2%, Gemini 3 Pro 85.3%, GPT-5.2 82.0% Telecom: Opus 4.6 99.3%, Opus 4.5 98.2%, Sonnet 4.5 98.0%, Gemini 3 Pro 98.0%, GPT-5.2 98.7% “Scaled tool use (MCP Atlas)”: Opus 4.6: 59.5% Opus 4.5: 62.3% Sonnet 4.5: 43.8% Gemini 3 Pro: 54.1% GPT-5.2: 60.6% “Agentic search (BrowseComp)”: Opus 4.6: 84.0% Opus 4.5: 67.8% Sonnet 4.5: 43.9% Gemini 3 Pro: 59.2% (Deep Research) GPT-5.2: 77.9% (Pro) “Multidisciplinary reasoning (Humanity’s Last Exam)”: Without tools: Opus 4.6 40.0%, Opus 4.5 30.8%, Sonnet 4.5 17.7%, Gemini 3 Pro 37.5%, GPT-5.2 36.6% With tools: Opus 4.6 53.1%, Opus 4.5 43.4%, Sonnet 4.5 33.6%, Gemini 3 Pro 45.8%, GPT-5.2 50.0% “Agentic financial analysis (Finance Agent)”: Opus 4.6: 60.7% Opus 4.5: 55.9% Sonnet 4.5: 54.2% Gemini 3 Pro: 44.1% GPT-5.2: 56.6% (5.1) “Office tasks (GDPVal-AA Elo)”: Opus 4.6: 1606 Opus 4.5: 1416 Sonnet 4.5: 1277 Gemini 3 Pro: 1195 GPT-5.2: 1462 “Novel problem-solving (ARC AGI 2)”: Opus 4.6: 68.8% Opus 4.5: 37.6% Sonnet 4.5: 13.6% Gemini 3 Pro: 45.1% (Deep Thinking) GPT-5.2: 54.2% (Pro) “Graduate-level reasoning (GPQA Diamond)”: Opus 4.6: 91.3% Opus 4.5: 87.0% S…
Opus 4.6 is here!
biggest wins on agentic search, HLE & ARC AGI 2
claude.com/blog/opus-4-...
finance.yahoo.com/news/traders...
#SaaSpocalypse #SaaS #Anthropic #Claude
Oh here we go again. Apple torpedoed one of my apps again and never bothered to tell me why. I was thinking of closing my account since I have no time for updates anyway.
#apple #indiedev
Before installing ClawdBot, make sure you’ve mitigated the risks. One way to shore things up is to use Slack or Signal for messaging. I wish all the automation gurus would stop recommending Telegram.
#clawdbot #agentic #telegram #ollama #security #automation
jpcaparas.medium.com/hundreds-of-...
Skool has a $9/month hobby plan for building online communities. Since I’m a “web guy” I get asked how to do something like that all the time. If you want to build a site where you can host courses, discussions etc you can checkout my affiliate link below. #skool
www.skool.com/signup?ref=5...
I’m trying to use Gemini CLI at work under their enterprise license. Every prompt is met with “Trying to reach gemini-2.5-pro (Attempt X/10)”
Then sometimes it reaches 10 and I have to tell it to try the flash version.
At least I get paid by the hour to stare at my screen.
#gemini
Anthropic has been alienating a lot of developers lately. Including myself with their token limits, daily and even weekly timeouts.
#anthropic #claudecode #claude #opencode #aidev
www.theregister.com/2026/01/05/c...
With 3 agents running in the background I always hit my Claude Code token limit in 15 minutes.
This is unsustainable.
I’ll be building up my home lab this year to move Ollama to a beefy server and shift my work there.
Then see what I can do until open source catches up.
#ollama #claudecode
Meta bails on open source AI.
#meta #llama #avocado #opensource #closedsource
www.cnbc.com/amp/2025/12/...
Of course there is the danger that the review itself is a hallucination. If I were really worried I'd have another agent do a second review.
To avoid posting quotes that were hallucinated by one AI agent, I've built another to review each quote and assign a status, along with an explanation. In this case, the agent found that the original attribution was misidentified.
#aiagents #workflow #verification #citation #n8n #anthropic #gemini
Photo of a magic 8 ball with a 'goatse' version of the OpenAI logo and the words ChatGPT on it, next to a white box that says "ChatGPT offline version". Inset are photos of the 8-ball responses such as "That's a great question", "You're 1000% right" and "Too many requests try again later" and then another photo from the back of the box of a holographic authenticity sticker that says FAKE.
After much research and development I have made an offline version of ChatGPT.
Now you can save water and electricity while navel-gazing, and carry one of the world's most powerfully annoying AI chatbots in your pocket.