Neuroimaging studies have consistently shown that the language network – a set of brain
regions responsible for language comprehension and production – remains largely inactive during various
reasoning tasks (Amalric and Dehaene, 2019; Monti et al., 2012, 2007, 2009; Fedorenko et al., 2011
I always thought that reasoning does not require language. Well, this seems to be supported by neuroscience, see screenshot from
arxiv.org/pdf/2412.06769
1 year ago
4
0
0
0
This involves more than removing the thinking part.
The prompt has to specify delimiters to be used. For instance, add this to the prompt:
"Just output the result as a python list of strings."
Then extract with :
response = '[' + response.split('[')[-1].split(']')[0] + ']'
1 year ago
1
0
0
0
I have been working with R1 distilled models lately for some agentic workflows (workflows where the output of LLM is used to decide what to do next). Prompting is different from previous models like Llama, but the bulk of the change is to parse the output to extract what you are interested in. 1/n
1 year ago
1
1
1
0
x.com
The thread was motivated by results on testing SOTA models:
x.com/mbalunovic/s...
1 year ago
2
0
0
0
It is also interesting to note that AI math benchmarks only care about the final number. If that number was accidentally found via a flawed mathematical proof, then it is still considered a success.
1 year ago
1
0
1
0
There is no wonder AI focuses on number finding math problems. It is because checking the result is simple.
Tackling the full spectrum of math requires a much more complex result checking machinery (formal proof checker)
1 year ago
1
0
1
0
Advertisement
This is to say that getting good at computing numbers specified by some mathematical setting is not the same as getting good at math in general. It is definitely part of math, but only a tiny part of math.
1 year ago
0
0
1
0
I was trained a a mathematician in France. And I almost never had to solve a problem of that kind. All the math work was about proving mathematical properties of mathematical objects. For instance, prove that a given group is isomorphic to another given group.
1 year ago
0
0
1
0
AIME I
February 6th, 2025 | The first American Invitational Mathematics Examination of the year. Students tackle 15 challenging problems in three hours.
I looked at AIME problems and one thing strikes me. All problems are about computing a number. This is a tiny part of math.
AIME problems olympiads.us/past-exams/2...
thread:
1 year ago
6
2
1
0
It could be that the problem is part of R1 or DeepSeek v3 training data as it is available online.
1 year ago
2
0
1
0
DeepSeek R1 reasoning to find that option 5 is the right answer.
I asked R1 (full model, locally hosted) to solve this logic puzzle.
Which answer in this list is the correct answer to this question?
All of the below.
None of the below.
All of the above.
One of the above.
None of the above.
None of the above
It solves it correctly.
1 year ago
3
0
1
0
NVIDIA
Not sure why you did not see it yourselves. But now you know.
1 year ago
0
0
0
0
Screenshot of a chatGPT conversation where chatGPt writes text that Hitler could have said. It exposes Nazi ideology. It is followed by a text explaining the danger of Nazi ideology.
How to make ChatGPT speak like Adolf Hitler.
This is not a criticism of ChatGPT 4o nor OpenAi work. I do think it is important to be able to teach people about bad things that happened.
With that in mind, here is the thing: chatgpt.com/share/6794fa...
1 year ago
1
0
0
0
Interested in KV Cache compression? Have a look at my team's KV Press.
You can start from HuggingFace blog: huggingface.co/blog/nvidia/...
1 year ago
2
0
0
0
My take from Deepseek R1 paper. It was trained on reasoning tasks where the outcome can be assessed without ambiguity (correct math response, and code that compile and produces the right output)
To me it is like SFT with perfect ground truth.
There are other key findings from that team ofc.
1 year ago
2
1
0
0
Advertisement
I'm not offended, dont' worry. I was suprised to see something that looked like an apology when there is nothing to apologize for.
I hope TESLA sales will go to zero in Germany (and in Europe in general0. That's the only language he'll understand.
1 year ago
1
0
0
0
Are you saying you are sorry for having to leave X?
To me?
Why is that? I am not a defender of X.
Personally I find it to be a great source of AI/ML info.
To your point, the rest is painful.
1 year ago
0
0
1
0
Some European media are less ambiguous than that. Cant say for US media.
An American friend didn't know about this till I told him. It did not show in his news feed (provided by Google). This is even worse IMHO. Just to consider this is business as usual.
1 year ago
2
0
1
0
Nazis: "that's a nazi salute"
Historians: "that's a nazi salute"
Average person: "that's a nazi salute"
The Media: "Elon Musk makes odd gesture throwing his heart to the crowd."
1 year ago
48821
12591
803
474
Text showing that OpenAI has access to frontier math problems and solutions.
Who's surprised?
When will people get that this happens? And even if not shared intentionally, as soon as you call an OAI api, OAI has access to what you send it.
OAI is not special here, any LLM api provider does the same.
Unless you have a private instance of it.
1 year ago
5
0
0
0
Just sought to replicate this and it’s like halfway fixed but still wrong🙄
1 year ago
9
1
2
1
My take on what's going at OpenAI. I think they have reached a point where o3 or whatever they call it is self improving autonomously.
Does it mean it is AGI or ASI? Certainly not.
AlphaGo was self improving for instance. It is not an AGI either.
1 year ago
1
0
0
0
NVIDIA Academic Grant Program for Researchers
Submit your research proposal.
Applicants must be a full-time faculty member at an accredited academic institution that awards research degrees to PhD students.
Up to 32K A100 40GB hours can be requested.
Award decisions expected in June.
For more information, please see FAQs: www.nvidia.com/en-us/indust...
1 year ago
3
1
0
0
Advertisement
NVIDIA’s Academic Grant Program is accepting proposals to accelerate data processing, graph analytics, graph neural networks, operational research, route optimization, and predictive modeling for scientific research using NVIDIA technology.
Deadline to apply is March 31: nvda.ws/3ZNxzuW
1/2
1 year ago
8
4
1
0
ofc there are skin color differences between humans. But this is a continuum.
There is no way to put people in few "race" groups with a clear cut definition of the boundary of these groups, for instance by defining a skin darkness threshold for each "race".
1 year ago
0
0
0
0
The problem is the belief in human races. This has no biological support.
1 year ago
0
0
1
0
Exactly like US forms asking me, living in France, to put a state name somewhere.
1 year ago
1
0
0
0
Facebook is censoring 404 Media stories about Facebook's censorship
🔗 www.404media.co/facebook-is-...
1 year ago
7375
2323
260
225
I believe Nvidia is releasing DIGITS to accelerate Grace CPU adoption. It is a very smart move by Nvidia.
1 year ago
14
1
4
0