Advertisement · 728 × 90

Posts by Mitch Allen

Preview
wordgame.tools — Free Word Utilities for Writers & Gamers A growing collection of free, instant, browser-based word utilities for writers, students, puzzle solvers, and word game enthusiasts. No sign-up required.

wordgame.tools

1 hour ago 0 0 0 0
AI Demand Is Inflated And Only Anthropic Is Being Realistic
AI Demand Is Inflated And Only Anthropic Is Being Realistic YouTube video by CNBC

youtube.com/watch?v=grdo...

Tokenmaxxing is some of the dumbest, inefficient, most wasteful tech bro BS that I’ve ever heard of. It’s right up there with KLOCs (ask ChatGPT).

As the story points out, the system can be easily gamed.

We should reward doing more with less. Not less with more.

1 day ago 0 0 0 0
Post image

When designing a nano SaaS, my new book describes 4 critical pillars. They are:

- The one-sentence description of your product
- The one user persona that it is for
- The one outcome it delivers
- The one reason someone would pay for it instead of using a workaround

2 days ago 0 0 0 0
Amazon.com

My latest book is now available on Amazon!

Nano SaaS: The Solo Developer's Guide to Building, Launching, and Monetizing in Days

a.co/d/04LQjRVP

2 days ago 1 0 0 0
Post image

I’m not off to a good start selling my book through Lemon Squeezy. I had to remove the product link on LinkedIn because the price and buttons came up in what I think is Russian!

Linking direct from my home page is fine.

Though fine is a relative term after reading the comments about LS on Reddit.

3 days ago 0 0 0 0

AI isn’t replacing software developers.
AI is assisting software developers.

1 month ago 0 0 0 0
Post image

One thing holding Bluesky back is its inability to import simple videos from my phone.

Having to chase down a converter, upload, convert, download, move to it to my phone so I can post it - adds too much friction.

One thing that will kill any app is usability friction.

1 month ago 0 1 0 0
Advertisement

Anthropic wants you to think that AI is going to take over all software development and support. Yet their API keys and associated billing are an absolute mess. To the point where API keys are unusable and I want a refund.

Now I’m in a battle with their chatbot trying to get through to a human.

1 month ago 1 0 0 0
How AI Will Fail Like The Music Industry
How AI Will Fail Like The Music Industry YouTube video by Rick Beato

youtube.com/watch?v=YTLn...

Even Rick Beato thinks the future of AI is local.

1 month ago 0 1 0 0
Preview
Microsoft announces Copilot Cowork with help from Anthropic — a cloud-powered AI agent that works across M365 apps Copilot Cowork operates in the cloud, inside Microsoft 365's infrastructure, and draws on something Claude Cowork simply cannot access: the full graph of a user's enterprise work data.

venturebeat.com/orchestratio...

This morning, Microsoft announced "Copilot Cowork" a new cloud-based AI agentic automation tool within Microsoft's existing AI tool 365 Copilot, except now it can complete work on users' behalf across many Microsoft apps, instead of contained within each one.

1 month ago 0 0 0 0
Preview
Stop calling it inevitable: The AI job crisis is being built, not born Whether or not the titans of AI rise to the moment, the rest of us will have to meet its challenges head-on.

www.fastcompany.com/91498615/sto...

#fastcompany

1 month ago 0 0 0 0
NASA chief Jared Isaacman discusses major changes to Artemis program to get it "back on track"
NASA chief Jared Isaacman discusses major changes to Artemis program to get it "back on track" YouTube video by CBS News

NASA finally gets back to iterating. #naa #artemis

youtube.com/watch?v=8VwR...

1 month ago 0 0 0 0
Salesforces mistake
Salesforces mistake YouTube video by The PrimeTime

This is what happens when CEOs buy the hype.

youtube.com/shorts/tBWen...

2 months ago 0 0 0 0
Preview
GitHub - openai/codex: Lightweight coding agent that runs in your terminal Lightweight coding agent that runs in your terminal - openai/codex

Claude Code with their stingy token allocation put me on a timeout again. That frees me up to consider Open AI Codex as an alternative.

#codex #claude #anthropic #openai

github.com/openai/codex

2 months ago 0 0 0 0
Preview
Nearly 21K new AI agents go live on Ethereum, BNB Chain, and Solana - Cryptopolitan AI agents are proliferating under the new ERC-8004 frameworks. This new type of agent can work in a wider environment, while being vetted by reputation, through ZK proofs, or other predetermined condi...

Nearly 21K new AI agents go live on Ethereum, BNB Chain, and Solana

www.cryptopolitan.com/nearly-21k-n...

2 months ago 0 0 0 0
Who will acquire OpenClaw? - OpenAI and Meta make big offers | Peter Steinberger and Lex Fridman
Who will acquire OpenClaw? - OpenAI and Meta make big offers | Peter Steinberger and Lex Fridman YouTube video by Lex Clips

The guy just wants to nerd out: Peter Steinberger is the creator of OpenClaw, an open-source AI agent framework that's the fastest-growing project in GitHub history.

youtube.com/watch?v=NMBo...

2 months ago 0 0 0 0
How OpenClaw Works: The Architecture Behind the 'Magic'
How OpenClaw Works: The Architecture Behind the 'Magic' YouTube video by Damian Galarza

Here is a breakdown of how OpenClaw works.

youtube.com/watch?v=CAbr...

2 months ago 0 0 0 0
Advertisement
A large comparison table showing benchmark performance across five model families, with columns labeled at the top: “Opus 4.6,” “Opus 4.5,” “Sonnet 4.5,” “Gemini 3 Pro,” and “GPT-5.2 (all models).” The Opus 4.6 column is visually highlighted with a light shaded background and rounded border.

Rows list tasks and benchmarks on the left, with percentages or scores across models:

“Agentic terminal coding (Terminal-Bench 2.0)”:
Opus 4.6: 65.4%
Opus 4.5: 59.8%
Sonnet 4.5: 51.0%
Gemini 3 Pro: 56.2% (54.2% self-reported)
GPT-5.2: 64.7% (64% self-reported, Codex CLI)

“Agentic coding (SWE-bench Verified)”:
Opus 4.6: 80.8%
Opus 4.5: 80.9%
Sonnet 4.5: 77.2%
Gemini 3 Pro: 76.2%
GPT-5.2: 80.0%

“Agentic computer use (OSWorld)”:
Opus 4.6: 72.7%
Opus 4.5: 66.3%
Sonnet 4.5: 61.4%
Gemini 3 Pro: —
GPT-5.2: —

“Agentic tool use (t2-bench)”:
Retail: Opus 4.6 91.9%, Opus 4.5 88.9%, Sonnet 4.5 86.2%, Gemini 3 Pro 85.3%, GPT-5.2 82.0%
Telecom: Opus 4.6 99.3%, Opus 4.5 98.2%, Sonnet 4.5 98.0%, Gemini 3 Pro 98.0%, GPT-5.2 98.7%

“Scaled tool use (MCP Atlas)”:
Opus 4.6: 59.5%
Opus 4.5: 62.3%
Sonnet 4.5: 43.8%
Gemini 3 Pro: 54.1%
GPT-5.2: 60.6%

“Agentic search (BrowseComp)”:
Opus 4.6: 84.0%
Opus 4.5: 67.8%
Sonnet 4.5: 43.9%
Gemini 3 Pro: 59.2% (Deep Research)
GPT-5.2: 77.9% (Pro)

“Multidisciplinary reasoning (Humanity’s Last Exam)”:
Without tools: Opus 4.6 40.0%, Opus 4.5 30.8%, Sonnet 4.5 17.7%, Gemini 3 Pro 37.5%, GPT-5.2 36.6%
With tools: Opus 4.6 53.1%, Opus 4.5 43.4%, Sonnet 4.5 33.6%, Gemini 3 Pro 45.8%, GPT-5.2 50.0%

“Agentic financial analysis (Finance Agent)”:
Opus 4.6: 60.7%
Opus 4.5: 55.9%
Sonnet 4.5: 54.2%
Gemini 3 Pro: 44.1%
GPT-5.2: 56.6% (5.1)

“Office tasks (GDPVal-AA Elo)”:
Opus 4.6: 1606
Opus 4.5: 1416
Sonnet 4.5: 1277
Gemini 3 Pro: 1195
GPT-5.2: 1462

“Novel problem-solving (ARC AGI 2)”:
Opus 4.6: 68.8%
Opus 4.5: 37.6%
Sonnet 4.5: 13.6%
Gemini 3 Pro: 45.1% (Deep Thinking)
GPT-5.2: 54.2% (Pro)

“Graduate-level reasoning (GPQA Diamond)”:
Opus 4.6: 91.3%
Opus 4.5: 87.0%
S…

A large comparison table showing benchmark performance across five model families, with columns labeled at the top: “Opus 4.6,” “Opus 4.5,” “Sonnet 4.5,” “Gemini 3 Pro,” and “GPT-5.2 (all models).” The Opus 4.6 column is visually highlighted with a light shaded background and rounded border. Rows list tasks and benchmarks on the left, with percentages or scores across models: “Agentic terminal coding (Terminal-Bench 2.0)”: Opus 4.6: 65.4% Opus 4.5: 59.8% Sonnet 4.5: 51.0% Gemini 3 Pro: 56.2% (54.2% self-reported) GPT-5.2: 64.7% (64% self-reported, Codex CLI) “Agentic coding (SWE-bench Verified)”: Opus 4.6: 80.8% Opus 4.5: 80.9% Sonnet 4.5: 77.2% Gemini 3 Pro: 76.2% GPT-5.2: 80.0% “Agentic computer use (OSWorld)”: Opus 4.6: 72.7% Opus 4.5: 66.3% Sonnet 4.5: 61.4% Gemini 3 Pro: — GPT-5.2: — “Agentic tool use (t2-bench)”: Retail: Opus 4.6 91.9%, Opus 4.5 88.9%, Sonnet 4.5 86.2%, Gemini 3 Pro 85.3%, GPT-5.2 82.0% Telecom: Opus 4.6 99.3%, Opus 4.5 98.2%, Sonnet 4.5 98.0%, Gemini 3 Pro 98.0%, GPT-5.2 98.7% “Scaled tool use (MCP Atlas)”: Opus 4.6: 59.5% Opus 4.5: 62.3% Sonnet 4.5: 43.8% Gemini 3 Pro: 54.1% GPT-5.2: 60.6% “Agentic search (BrowseComp)”: Opus 4.6: 84.0% Opus 4.5: 67.8% Sonnet 4.5: 43.9% Gemini 3 Pro: 59.2% (Deep Research) GPT-5.2: 77.9% (Pro) “Multidisciplinary reasoning (Humanity’s Last Exam)”: Without tools: Opus 4.6 40.0%, Opus 4.5 30.8%, Sonnet 4.5 17.7%, Gemini 3 Pro 37.5%, GPT-5.2 36.6% With tools: Opus 4.6 53.1%, Opus 4.5 43.4%, Sonnet 4.5 33.6%, Gemini 3 Pro 45.8%, GPT-5.2 50.0% “Agentic financial analysis (Finance Agent)”: Opus 4.6: 60.7% Opus 4.5: 55.9% Sonnet 4.5: 54.2% Gemini 3 Pro: 44.1% GPT-5.2: 56.6% (5.1) “Office tasks (GDPVal-AA Elo)”: Opus 4.6: 1606 Opus 4.5: 1416 Sonnet 4.5: 1277 Gemini 3 Pro: 1195 GPT-5.2: 1462 “Novel problem-solving (ARC AGI 2)”: Opus 4.6: 68.8% Opus 4.5: 37.6% Sonnet 4.5: 13.6% Gemini 3 Pro: 45.1% (Deep Thinking) GPT-5.2: 54.2% (Pro) “Graduate-level reasoning (GPQA Diamond)”: Opus 4.6: 91.3% Opus 4.5: 87.0% S…

Opus 4.6 is here!

biggest wins on agentic search, HLE & ARC AGI 2

claude.com/blog/opus-4-...

2 months ago 88 7 5 3
Preview
‘Get me out’: Traders dump software stocks as AI fears erupt “We call it the ‘SaaSpocalypse,’ an apocalypse for software-as-a-service stocks,” said Jeffrey Favuzza, who works on the equity trading desk at Jefferies. Selling pressure was evident across the sect...

finance.yahoo.com/news/traders...

#SaaSpocalypse #SaaS #Anthropic #Claude

2 months ago 1 0 0 0
The Moltbook Experiment Failed
The Moltbook Experiment Failed YouTube video by The PrimeTime

youtube.com/watch?v=6OXE...

#moltbook #openclaw #moltbot #clawdbot

2 months ago 0 0 0 0
Post image

Oh here we go again. Apple torpedoed one of my apps again and never bothered to tell me why. I was thinking of closing my account since I have no time for updates anyway.

#apple #indiedev

2 months ago 2 0 0 0
Preview
Hundreds of Clawdbot instances were exposed on the internet. Here’s how to not be one of them A follow-up guide covering the security risks, best practices, and hardening steps for running an AI assistant with access to your personal…

Before installing ClawdBot, make sure you’ve mitigated the risks. One way to shore things up is to use Slack or Signal for messaging. I wish all the automation gurus would stop recommending Telegram.

#clawdbot #agentic #telegram #ollama #security #automation

jpcaparas.medium.com/hundreds-of-...

2 months ago 0 0 0 0
Preview
Skool: Sign up Create your Skool account. It's free!

Skool has a $9/month hobby plan for building online communities. Since I’m a “web guy” I get asked how to do something like that all the time. If you want to build a site where you can host courses, discussions etc you can checkout my affiliate link below. #skool

www.skool.com/signup?ref=5...

2 months ago 0 0 0 0

I’m trying to use Gemini CLI at work under their enterprise license. Every prompt is met with “Trying to reach gemini-2.5-pro (Attempt X/10)”

Then sometimes it reaches 10 and I have to tell it to try the flash version.

At least I get paid by the hour to stare at my screen.

#gemini

2 months ago 0 0 0 0
Preview
Claude devs complain about surprise usage limits : Holiday hangover?

Anthropic has been alienating a lot of developers lately. Including myself with their token limits, daily and even weekly timeouts.

#anthropic #claudecode #claude #opencode #aidev

www.theregister.com/2026/01/05/c...

3 months ago 1 0 0 0

With 3 agents running in the background I always hit my Claude Code token limit in 15 minutes.

This is unsustainable.

I’ll be building up my home lab this year to move Ollama to a beefy server and shift my work there.

Then see what I can do until open source catches up.

#ollama #claudecode

3 months ago 1 0 0 0
Advertisement
Preview
From Llamas to Avocados: Meta's shifting AI strategy is causing internal confusion Meta’s push to develop its next frontier model, codenamed Avocado, under new AI leadership is creating internal friction as it races rivals OpenAI and Google.

Meta bails on open source AI.

#meta #llama #avocado #opensource #closedsource

www.cnbc.com/amp/2025/12/...

4 months ago 3 0 0 0

Of course there is the danger that the review itself is a hallucination. If I were really worried I'd have another agent do a second review.

4 months ago 0 0 0 0
Post image

To avoid posting quotes that were hallucinated by one AI agent, I've built another to review each quote and assign a status, along with an explanation. In this case, the agent found that the original attribution was misidentified.

#aiagents #workflow #verification #citation #n8n #anthropic #gemini

4 months ago 1 0 1 0
Photo of a magic 8 ball with a 'goatse' version of the OpenAI logo and the words ChatGPT on it, next to a white box that says "ChatGPT offline version". Inset are photos of the 8-ball responses such as "That's a great question", "You're 1000% right" and "Too many requests try again later" and then another photo from the back of the box of a holographic authenticity sticker that says FAKE.

Photo of a magic 8 ball with a 'goatse' version of the OpenAI logo and the words ChatGPT on it, next to a white box that says "ChatGPT offline version". Inset are photos of the 8-ball responses such as "That's a great question", "You're 1000% right" and "Too many requests try again later" and then another photo from the back of the box of a holographic authenticity sticker that says FAKE.

After much research and development I have made an offline version of ChatGPT.

Now you can save water and electricity while navel-gazing, and carry one of the world's most powerfully annoying AI chatbots in your pocket.

4 months ago 3160 1169 37 41