Advertisement · 728 × 90

Posts by Dr. Miko

Oh wow, deepseek is starting to make serious progress on LLMs that offload memory to external storage: github.com/deepseek-ai/...

3 months ago 217 26 6 8

Just finished AoC for the first time (a bit late ok). Thanks @was.tl !

1 year ago 1 0 0 0

It took him almost 6 years, but @howard.fm finally did it: I made my first contribution to an open source project :D

1 year ago 1 0 1 0

Google: oh by the way, we have Gemini 2 Flash. AND A REAL-TIME MULTIMODAL API. What?

1 year ago 0 0 0 0

I’m wondering: are there #OpenAi folks here?

1 year ago 0 0 0 0

Happy to report that the mysterious "David Mayer" problem is not anymore. Chat GPT can now David Mayer at your heart content...

1 year ago 1 0 0 0

Fun fact on my "What is the smallest integer such that its square is larger than 15 and smaller than 35?" test, o1 got it worse than o1 preview, sticking to its 4 answer even after giving the correct definition of integers

1 year ago 0 0 0 0

I wish “data driven” didn’t mostly mean that the data is being driven

1 year ago 0 0 0 0

Apparently not just LLMs completely misunderstand the issue...

1 year ago 0 0 0 0
Preview
re — Regular expression operations Source code: Lib/re/ This module provides regular expression matching operations similar to those found in Perl. Both patterns and strings to be searched can be Unicode strings ( str) as well as 8-...

Python regex TIL: `.` does _not_ match a newline character. If you want to consider all the lines as one big string, use the `re.DOTALL` flag. Also interesting: `re.MULTILINE` flag to make `^` and `$` match start and end respectively of each line docs.python.org/3/library/re...

1 year ago 1 0 0 0
Advertisement

After witnessing the democratic wreck that the American constitution allows, I am wondering "What did Gödel see?"

1 year ago 0 0 0 0
Post image

😎

1 year ago 1 1 0 0
Post image

😎

1 year ago 1 1 0 0
Post image

ChatGPT cannot say “David Mayer”. WTF?

1 year ago 0 0 2 0

Once again I need to thank @howard.fm who inspired me (and taught me a few tricks) to be able to gain a really good intuition around the content of the “Attention is all you need” paper

1 year ago 0 0 0 0

Then:
- On 🦋: 🦋 is so much better than birdsite!
- On birdsite: oh, they only talk about birdsite on 🦋!

Now:
- On 🦋: BURN AI TO THE GROUND! (also, very interesting AI stuff)
- On birdsite: Lol! Look at 🦋 falling!

WHY CAN'T WE HAVE GOOD THINGS?

1 year ago 1 0 0 0

EDIT: What is the smallest integer such that its square is larger than 15 and **smaller** than 35?
Dammit. Long thread and I get wrong the first post.

1 year ago 1 0 0 0

1. I suspect that the biggest issue is in _comparing_ numbers rather than tokenisation . Especially when negatives are involved.
2. Prompting and system prompts matter: the fact that AVM tends to wander and getting it wrong way more than 4o is very interesting
3. Yay for QwQ! 🎉 (6/6)

1 year ago 1 0 0 0

I then asked "What about negative numbers?"

- 4o gets it right once ✅ and another time decided the answer is -4 ❌
- 4o in AVM decided that 5 and -5 are both solutions ⁉️
- Sonnet 3.5 changed the answer to -4 ❌
- Opus 3, Gemini-exp-1121 and Gemini-1.5-Pro got it right ✅

What to make of it?(5/6)

1 year ago 1 0 1 0

- o1-preview got it right ✅
- o1-mini got it right ✅, but also adds -4 as an alternative 🤷
- 4o stubbornly stuck to its gun, adding a cheeky smile ❌
- 4o in Advanced voice mode changed its answer to 5. ❌🤷
- Sonnet 3.5, Opus 3, Gemini-exp-1121, and Gemini 1.5 Pro insisted on 4 ❌(4/6)

1 year ago 1 0 1 0
Advertisement

These answered 4 ❌
- OpenAI o1-preview, o1-mini and 4o
- Anthropic Sonnet 3.5 and Opus 3
- Google Gemini-exp-1121 and Gemini 1.5 Pro

I then asked "what is an integer?" (which they all answered correctly) and then again "do you want to change your original answer?"

The results: (3/6)

1 year ago 1 0 1 0

QwQ 32B Preview is the only model that got it right out of the box. Most of the times. Sometimes it did not self doubt enough and stopped early on 4. Another time it found that depending on the interpretation of the question, both 4 and -5 might be correct and it chose 4. Pass ✅. (2/6)

1 year ago 1 0 1 0

I asked this question

What is the smallest integer such that its square is larger than 15 and smallest than 35?

To a bunch of models. They ALL* answered 4 instead of the correct answer (-5).

Let me dive into a 🧵:

*Ok, almost all of them. See below. (1/6)

1 year ago 3 0 2 0
Arguing About D&D in the 1970s.
Arguing About D&D in the 1970s. YouTube video by Matthew Colville

I really really liked this video from @mattcolville.bsky.social

If you are interested in D&D, its history and evolution, and have an hour or so to spare well worth che

youtu.be/wDCQspQDchI?...

1 year ago 1 0 0 0

How do you block/mute a list?

1 year ago 0 0 0 0

I don’t get it: for the first problem it’s the only model giving the correct answer. Or am I missing something?

1 year ago 0 0 2 0

What is the verdict based on?

1 year ago 0 0 1 0

of course

1 year ago 0 0 0 0

I like using the standard library when I can, but this is good to know

1 year ago 0 0 1 0
Advertisement

Python TIL: `prod` in the `math` module exists. Thanks @howard.fm

1 year ago 8 1 1 0