I misinterpreted what I'd read. I saw simulated user and inferred that that meant an air gapped system. However, the model never had access to its model weights, which would be the air gap in terms of self replication.
Posts by Middletown Books
Duplicated how? It never had access to its own model weights.
Of course, it also makes me wonder if #Mythos has an anthill inside sticker like Hex from the #Discworld #punes #AI
Didn't have access to its own model weights, at any rate.
Claude Mythos apparently seems both more #humorous and better aligned than many previous #Anthropic models, (based on the document). It also apparently has a #punderful new ability, lacking in #Opus - apparently original #pun creation. Can't hate that, in my book.
Wasn't it a virtual sandbox, though? I think that's what I recall reading, anyway.
Link for your Medium article about #OpenAI and its findings: medium.com/the-imperfec...
Mythos is truly amazing. I see the patterns here:
bsky.app/profile/impe...
In all the #cybersecurity hype about the #Anthropic model Mythos, it's important to look at what may get neglected in superficial headline making coverage. This essay is a great attempt to do that. #AIResearch #ResponsibleAI #Mythos
This looks interesting #ResponsibleAI #CitizenScience #AIResearch
For those interested in #AIResearch - not the numbers from the companies, but analysis of what is being neglected in stories about research papers, check out medium.com/@miravale.in... Highly recommended by me, based on what I've read. Things are moving quickly and important points are being ignored
Tweets disappear. Emails get buried. But a postcard on a desk? That gets noticed. civicmail.org/about-campai... via @civicmail.bsky.social #bookbans #libraries
I have them all in print form and I'm still tempted. I can't recommend Pratchett enough ๐ฅฐ
I only own a few, and mostly read physical books from the library vs. digital, but this is mightily tempting for me as well.
Hey, all! Sorry that I've mostly been using reddit lately and have been remiss on posting #PratchettQOTD here for some time now. I ran across a #Discworld bundle, though, and shared it to my reddit page (39 of the 41 volumes in the series are included in the bundle) - www.reddit.com/user/Middlet...
www.reddit.com/r/books/comm... #BlackHistoryMonth #BlackBookSky
IIn celebration of both #NationalLibraryLoversMonth and #BlackHistoryMonth I posted a link to r/books on some history of Black librarianship in the U.S. Dorothy B. Porter basically desgregated the Dewey Decimal system, for example. #libraries #librarians #blacksky www.reddit.com/r/books/comm...
Hey @sdcushing.bsky.social - let's chat sometime soon
Okay, thank you. Please be patient, I'll need to populate the beginnings of a third starter pack for college libraries, international libraries, etc.
I'm getting back on social media a bit more (but also want to read more #booksky material), so I'm seeking #libraries to add to my library starter packs which I made last year. If you're a library account & not in my packs, or know of one LMK. bsky.app/starter-pack... bsky.app/starter-pack...
2026 is looking up for #Bluesky - and this looks like a good feed to follow
ATM, I'm on a #classicrock posting jaunt there. Getting some top 25 hits, but nothing top 10 so far. Will take play requests here, though. ;)
A couple of my top 10 greatest hits from #reddit involve #bluesky #starterpacks so it seems appropriate to do a year end review and post them here: www.reddit.com/r/BlueskySoc... www.reddit.com/r/BlueskySoc...
I lack the foundation to gain insights beyond areas where one LLM might do better than another. As LLMs advance in capabilities, I'm sure we'll see more crossovers between specialized STEM fields. Note that the cross pollination was human led in this case, I was just seeing how well AI could follow.
As a recent example, the paper in @quantamagazine.bsky.social crosses between #StringTheory and #algebraic #geometry. Frontier LLMs, due to their training data are a lot better at making a crossovers like that than the average human, so that gives them an edge vs an average grad student.
#Sonnet tends to be my prompt engineer & peer reviewer. For explorations, I've mainly been using GPT 5+ & Gemini 2.5+. Cross disciplinary explorations & programming seem to be the main strong points so far for the frontier models, based on my research. The models are well rounded grad student types.
Mainly open #math problems which #AI #GapAnalysis indicates #frontier LLMs might potentially make some progress on. Ramsey numbers, Hadamard matrices, covering arrays, optimal packing problems, branching problems... the list goes on. So far, nothing really breakthrough in terms of results, though.
I've tried older versions of #Sonnet for various writing tasks, including #creative #writing & custom prompt bots. The last few months, my main focus has been on testing #frontier #LLMs in terms of their #math capabilities and limits, so that has been my main use case for the 4.5 version of Sonnet