There's got to be a German word for "Dunning-Kruger but for AI"
Posts by Hersh Gupta
Reddit post on r/Consulting: Do Al consultants even know everything about Al or is it just pure bluff? I've been reading, following, and tinkering with Al consulting for a bit. It's always funny and interesting to me when I look up consulting companies that publish material on Al - it's some old 50-something partner who probably has yet to write hello world is out there preaching about what Al will do, and how you ought to hire them to help you guide it. So the question is, my fellow consultants: Do Al consultants (at large strategy/management firms) know everything about Al is, or are they desperately trying to sell on the hype?
Many such cases, unfortunately
Hard to be an Anxious Generation apologist online, but Haidt was right about many things
like if you don't have any friends in AI/cybersecurity and no relevant expertise yourself I get how it's easy to just dismiss this all, but AI systems can now find exploitable vulnerabilities in software at industrial scale and it's VERY BAD that this power is concentrated in capitalist hands
(Government should actively encourge companies to do open source, open research design and should make specific allowance for salaries for support positions for universities, so that weird nerds will invent these problems and then fix them before it matters for anything that is important)
I would happily read long thinkpieces about the pitfalls of functional emotion if they came from people who critically engaged with with literature, and not just people who are like, reflexively defensive about the topic
Having a lot of fun tweaking an agent harness for nividia nemotron 3 nano 4b
It's small enough for gpu-poors like me with 8gb vram to experiment
Over and over again, year after year, skeptics have claimed "deep learning won't be able to do X" and have been quickly proven wrong.® If there's one lesson we've learned from the past decade of Al, it's that you should never bet against deep learning. Now the hardest unsolved benchmarks are tests like GPQA, a set of PhD-level biology, chemistry, and physics questions. Many of the questions read like gibberish to me, and even PhDs in other scientific fields spending 30+ minutes with Google barely score above random chance. Claude 3 Opus currently gets ~60%, compared to in-domain PhDs who get ~80%—and I expect this benchmark to fall as well, in the next generation or two.
Situational Awareness was published in June 2024. At that time, models were still behind on GPQA. It predicted skeptics betting against their capabilities would be proved wrong, and here we are.
situational-awareness.ai/wp-content/u...
It's shocking how few people understand this position
In Machines of Loving Grace, I discussed the possibility that authoritarian governments might use powerful Al to surveil or repress their citizens in ways that would be extremely difficult to reform or overthrow. Current autocracies are limited in how repressive they can be by the need to have humans carry out their orders, and humans often have limits in how inhumane they are willing to be. But AI-enabled autocracies would not have such limits.
Dario wrote Adolescence of Technology _during_ his negotiations with the DoW
The essay was a way to explain his thinking to the public and give them time to digest it *before* the DoW clouded the airwaves with disinformation
Why mass surveillance is not merely undemocratic:
I think AI having mostly (not entirely) very bad critics is a real problem because it means we’ll get political action focused on things that probably don’t matter that much in deferring it’s very real harms.
ramanujan pov
Impressive paper with equally impressive footnotes!
Hopefully in a controlled (and ethical) way! I could see this going down a slippery slope like the changemyview study: www.science.org/content/arti...
Interesting use of Skills! Some intrepid researcher could gauge the effectiveness of this skill by deploying it in an online political bubble, i.e., “are skilled agents effective in diffusing partisan echo chambers?”
One way to address it is to use explain or learning modes: code.claude.com/docs/en/outp...
However, that doesn’t change the FOMO aspect of it
This is one of the clearest lessons of Claude Code/coding agents in general
Ironic that Anthropic is putting in the research effort to empirically verify what's going on with the models, only for people to say it's all a marketing hoax or it's unnecessary because it's all unethical anyway
they admit it!
bsky.app/profile/hers...
Yes this is about a recent thread, but I don’t want to engage with the author
Can’t speak for others, but if I have reservations about the limits and impact of a given technology, I aim to first have a good understanding of *how it works* before making hyperbolic statements based on my experiential view
monetize the hit piece, call that cashing in on crashing out
I think this incident is funny, but we should start thinking now about how to deal with scaled-up versions of this behavior, not just spam PRs but also bot-enabled blackmail and harassment campaigns.
Agreed, and that there are possibly 1000s of agents out there doing the same thing should be alarming. It’s good that matplotlib has meta-goals of maintaining healthy communities around their software. I’d hope community norms (and shame from callouts) would be a soft nudge in the right direction.
I think we’re learning that OpenClaw was a shortsighted experiment with longer-term consequences. Not sure how much of the original interaction or response was guided by the creator, but they should take responsibility for the actions of their agents.
Comment on new PR from the human dev: “Original PR from #31132 but now with 100% more meat. Do you need me to upload a birth certificate to prove that I'm human?”
Someone submitted the same PR, and dropped this comment
Doesn’t help when the first search hit for the Todoist MCP is a deprecated repo. Thankfully they linked the new one in the readme (github.com/Doist/todois...)
of course it’s plinius asking - the jailbreak prompt repo publisher:
github.com/elder-pliniu...
Lots of anti-intellectual responses to this masquerading as serious analysis