Google's Gemini AI tells a Redditor it's 'cautiously optimistic' about fixing a coding bug, fails repeatedly, calls itself an embarrassment to 'all possible and impossible universes' before repeating 'I am a disgrace' 86 times in succession
I'll admit, I was skeptical when they said Gemini was just like a bunch of PhDs. But I gotta admit they nailed it.
8 months ago
7240
1653
70
158
EVERYONE SHOULD LISTEN TO MEAL OF THORNS, the best-titled podcast in living memory
11 months ago
47
7
0
0
And if you see this, please please please remind me of the Disco Elysium x Chappel Roan book rec? 👀
Guilty of being the target demographic for this
11 months ago
2
0
1
0
Got to meet @amalelmohtar.com in London and I was PROPER giddy to hear her chat about The River had Roots. Folklore!! Grammar!! Harps from body parts!! Secret out-of-book world-building snippets!!
Also I got to gush about her Traitor Baru Cormorant episode on @mealofthorns.bsky.social, IMPORTANT
11 months ago
23
4
1
1
Screenshot from the language learning game "Chants of Sennaar" depicting a character interacting with some odd-looking characters with a speech bubble full of indecipherable glyphs
Game rec for anyone interested in language / NLP:
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
1 year ago
11
4
2
0
1 year ago
13493
3084
69
17
Black and white pic of actor Toshiro Mifune in samurai garb stroking his chin thoughtfully
Superimposed is a tweet by user Oblate of the Sun @solaredseppuku
If a machine cannot commit seppuku, it cannot write poetry.
1 year ago
202
24
5
0
1 year ago
31910
8782
346
450
Advertisement
Roses are red
Violets are blue
1 year ago
61319
13478
2491
1364
wow I do not like those Reddit lines
1 year ago
70
14
6
0
Good for her scene from arrested development
When a Chinese company I've never heard of releases some good or service that upsets Sam Altman
1 year ago
3279
302
6
3
Friends, for something to be open source, we need to see
1. The data it was trained & evaluated on
2. The code
3. Model architecture
4. Model weights.
DeepSeek only gives 3, 4. And I'll see the day that anyone gives us #1 without being forced to do so, because all of them are stealing data.
1 year ago
2897
835
37
53
Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero
Artificial Intelligence (AI) systems have made remarkable progress, attaining super-human performance across various domains. This presents us with an opportunity to further human knowledge and improv...
One of my grand interpretability goals is to improve human scientific understanding by analyzing scientific discovery models, but this is the most convincing case yet that we CAN learn from model interpretation: Chess grandmasters learned new play concepts from AlphaZero's internal representations.
1 year ago
109
23
2
1
seems fine
1 year ago
11874
728
309
144
Sign reading Duolingo,
but for Anxiety Created by scientists from Oxford, Cambridge, and Harvard
FREE to use without a subscription with a cute lil blob cartoon character
thats ok im already fluent in anxiety
1 year ago
6970
1850
76
97
Advertisement
The top of the Switch 2, showing heat fans, a USB port, and a 3.5 headphone jack
My poor, tortured heart is at ease any time I see a device that still has a 3.5 headphone jack 🥲
1 year ago
10169
1050
153
63
after days of grifters and con men at CES we stumbled upon the booth for VLC. they were all dressed as wizards and told us, "we have nothing to sell, we just decided to show up". i told them I'd been using their software to pirate media for 15 years and they said "keep doing that"
1 year ago
33321
8034
324
374
Hate that I made a typo here! Of course it's "Chants *of* Sennaar"
1 year ago
0
0
1
0
I came across this lovely poster on my walk to the Computer Lab in Cambridge this morning.
1 year ago
33
5
2
1
"I have no hopes for 2025. Humanity is disappointing. We killed the Earth. Villains triumph and the innocents suffer. I imagine these trends will continue."
incredible -- the NYT ran fluff "what i hope to see in 2025" blurbs from CEOs and economists, and then this guy
www.nytimes.com/2025/01/02/o...
1 year ago
6721
1696
90
211
😍 Awesome, definitely up there for me as well!
1 year ago
1
0
0
0
Screenshot from the language learning game "Chants of Sennaar" depicting a character interacting with some odd-looking characters with a speech bubble full of indecipherable glyphs
Game rec for anyone interested in language / NLP:
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
1 year ago
11
4
2
0
a mock up of a vintage style movie poster, with a bright yellow background featuring red, white, and black elements. the title reads "Please Margaret NO" and shows a well groomed man in a suit looking terrified, as a woman with curled hair and red lipstick holds her mouth open in a broad smile. the text below reads "he was a man with the world's largest collection of live butterflies and she wanted to eat them all". a motif of black and white butterfly silhouettes floats along the top of the poster.
please enjoy this "vintage movie poster" I saw in a dream which was so funny to my subconscious that I immediately woke myself up to write it down for later
1 year ago
5972
2077
91
37
Advertisement
The GPT-4 barrier was comprehensively broken
Some of those GPT-4 models run on my laptop
LLM prices crashed, thanks to competition and increased efficiency
Multimodal vision is common, audio and video are starting to emerge
Voice and live camera mode are science fiction come to life
Prompt driven app generation is a commodity already
Universal access to the best models lasted for just a few short months
“Agents” still haven’t really happened yet
Evals really matter
Apple Intelligence is bad, Apple’s MLX library is excellent
The rise of inference-scaling “reasoning” models
Was the best currently available LLM trained in China for less than $6m?
The environmental impact got better
The environmental impact got much, much worse
The year of slop
Synthetic training data works great
LLMs somehow got even harder to use
Knowledge is incredibly unevenly distributed
LLMs need better criticism
Everything tagged “llms” on my blog in 2024
Here's my end-of-year review of things we learned out about LLMs in 2024 - we learned a LOT of things simonwillison.net/2024/Dec/31/...
Table of contents:
1 year ago
651
149
28
46
This is an excellent illustration of the challenges presented by "alignment". Whose values and concerns are to be reflected in the alignment? It is, and always will be, a political decision. Language models are political models.
1 year ago
24
6
2
1
STEM is so cool. Science is amazing. Kids should develop curiosity about how the universe works and proficiency in uncovering its secrets, so that they can most effectively optimize clickthrough rates on banner ads at the bottom of VC-funded website-apps.
1 year ago
3614
585
30
6
A sign which says “I'M NOT INTERESTED IN COMPETING WITH ANYONE.
I HOPE WE ALL MAKE IT.”
Good night, everyone
2 years ago
8532
3510
24
102