Advertisement ยท 728 ร— 90

Posts by Carlo Daffara

In conclusion, I think that the real score would definitely be much lower than a pass. Not to imply that an LLM (or whatever comes after) will not be able to do it in the future, but that moment is undoubtedly not today.

1 year ago 1 0 0 0
Post image

A2 is equally flawed. It gives the right solution, and a proof for n=1 but for n=>2 it basically bails out with "it's much harder to achieve a neat identity". So again, if the test requires a proof for the solution, this is definitely not it.

1 year ago 0 0 1 0
Post image

For those that heard that o1 aced the Putnam test, well - no, it did not. For A1 it does demonstrate that n=1 has infinite solutions and that n=2 has no solution, but gives no real demonstration for n>2. It would probably not score as a valid pass.

1 year ago 1 0 1 0

Monday. It's another monday.

1 year ago 1 0 0 0
Post image

I am testing QwQ-32B (using this quantized version: huggingface.co/sbeltz/QwQ-3... ) on my lowly AMD device. Oh boy, does it perform. It is very, very chatty, takes a looong time to think, but it reaches the right answer at the end.

1 year ago 2 0 0 0
Preview
Understanding Quantum Technologies on arXiv and Amazon I published the 2024 and 7th edition of the book

If you're curious about the state of #quantumcomputing, one of the resources I find myself recommending (and referencing) is Olivier Ezratty's "Understanding Quantum Technologies".

It's available for free on Arxiv, or via Amazon for the hardcopy versions.

www.oezratty.net/wordpress/2...

1 year ago 7 1 0 1
Image of a paper titled "Can apparent superluminal neutrino speeds be explained as a quantum weak measurement?" The abstract says simply, "Probably not."

Image of a paper titled "Can apparent superluminal neutrino speeds be explained as a quantum weak measurement?" The abstract says simply, "Probably not."

Love this abstract!

from arxiv.org/ftp/arxiv/pa... ๐Ÿงช

1 year ago 120 29 8 2

An all new Econ of AI starter pack (by @arpitrage.bsky.social) : go.bsky.app/DfnDyqb

1 year ago 21 9 4 0

If there's one thing where BlueSky excels, is at aggregating people that is having fun doing what they love. I can't imagine a list like this (fun economists) on Twitter anymore.

1 year ago 1 0 0 0
Advertisement
Preview
OSOR Open Consultation for the draft Open Source Handbook The Open Source Observatory is launching a one-month consultation to further develop and refine its recently published draft Handbook for public administrations.

As I'm finally settling in, here are two upcoming events I'm working on right now:

1๏ธโƒฃ OSOR Handbook Consultation and workshop (12 Dec) interoperable-europe.ec.europa.eu/collection/o...
2๏ธโƒฃ FOSDEM Devroom on EU Open Source Policy CFP โ€“ Deadline 1 December! softwarefreedom.net/fosdem-25-cfp

1 year ago 4 1 0 0

Ok, this is my first toe in the water here. And I will start with a question โ€” any hint for whom to follow that is currently researching the economic impact of GenAI/AI as a horizontal tech? As a start, I looked at @erikbryn.bsky.social profile here and followed most of the people he follows ๐Ÿ˜

1 year ago 5 1 1 0