Advertisement · 728 × 90

Posts by Adi Simhi

Check out our new paper, investigating phenomena (hallucination, refusal, and sycophancy) both externally and internally! Showing a high correlation between the two!

1 month ago 4 0 0 0

ManagerBench was accepted to #ICLR2026🎉
Check it out⬇️

2 months ago 1 0 0 0

Check out our new paper on evaluating LLM agents on their preference for achieving their goal and avoiding human harm, called ManagerBench👔

6 months ago 2 0 0 0

🔍Check out our paper "Trust Me, I’m Wrong: High-Certainty Hallucinations in LLMs", at arxiv.org/pdf/2502.12964 and code at github.com/technion-cs-...

1 year ago 3 0 0 0

What do you think? 🤔
Could high-certainty hallucinations be a major roadblock to safe AI deployment? Let’s discuss! 👇

1 year ago 1 0 1 0

🔮 Takeaway:
We need new approaches to understand hallucinations so we can mitigate them better.
This research moves us toward deeper insights into why LLMs hallucinate and how we can build more trustworthy AI.

1 year ago 4 0 1 0

💡Why does this matter?
- Not all hallucinations stem from uncertainty or lack of knowledge.
- High-certainty hallucinations appear systematically across models & datasets.
- This challenges existing hallucination detection & mitigation strategies that rely on uncertainty signals

1 year ago 3 0 2 0
Post image

🛠️How did we test this?
We used knowledge detection & uncertainty measurement methods to analyze when and how hallucinations occur.

1 year ago 2 0 1 0

🚨Key finding:
LLMs can produce hallucinations with high certainty—even when they possess the correct knowledge!

1 year ago 2 0 1 0

🔍The problem:
LLMs sometimes generate hallucinations - factually incorrect outputs. assuming that if the model is certain and does not lack knowledge it must be correct.

1 year ago 2 0 1 0
Advertisement
Post image

🚨New arXiv preprint!🚨
LLMs can hallucinate - but did you know they can do so with high certainty even when they know the correct answer? 🤯
We find those hallucinations in our latest work with @itay-itzhak.bsky.social, @fbarez.bsky.social, @gabistanovsky.bsky.social and Yonatan Belinkov

1 year ago 21 10 3 2