To take a step in this direction, we propose a statistical perspective of XAI, focusing on inference, uncertainty quantification and hypothesis testing. But it is only a start!
(with @maximemeloux.bsky.social , Giada Dirupo, FranΓ§ois Portet) @cnrs.fr @getalp.bsky.social @cnrsalpes.bsky.social
Posts by Maxime Peyrard
Psychology, econometrics, or neuroscience, have faced similar difficulties and reacted by adopting methodological reforms and rigorous statistical (causal) frameworks.
We believe, it is now our turn to build the methodological guardrails turning XAI into a pragmatic science.
This affects feature attribution, probing, SAEs, and even causal analyses
Taking a statistical view, we argue that most interpretability queries are non-identifiable: multiple incompatible explanations fit the same computation, leading to false positives and poor generalization
New preprint: The Dead Salmons of XAI
Standard fMRI pipelines once detected predictive brain regions in a dead salmon! A warning about poor statistical methodology
Now, XAI faces its own issues: many methods can yield plausible explanations even for randomized networks
arxiv.org/abs/2512.18792
I'm recruiting multiple PhD students for Fall 2026 in Computer Science at @hopkinsengineer.bsky.social π
Apply to work on AI for social sciences/human behavior, social NLP, and LLMs for real-world applied domains you're passionate about!
Learn more at kristinagligoric.com & help spread the word!
I'm very happy to present our work "Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?" this afternoon at #ICLR2025! Come have a chat at stand #439 :)
What can be done?
π Stricter validity criteria?
π Maybe interpretability is inherently underdetermined? and we can only get control and predictability but not "understanding"
This is a fascinating topic, and we keep investigating. If you're interested, come and chat at ICLR!
We find a lot of identifiability issues:
- Multiple explanatory algorithms exists
- Even for one algorithm, there are many localizations in the network
Identifiability problems remain across scenarios: changing levels of over-parametrization, progress in training, multi-tasks, model size.
In our work, we stress-test the identifiability of research programs of MI with small MLPs and simple boolean logic tasks.
Why? It allows us to enumerate all possible explanations and see how many pass various MI testing criteria.
This brings us to identifiability. In statistics a property is identifiable if a unique value is compatible with the data. Identifiability matters because it is a prerequisite for doing statistical and causal inference.
Interpretability is also an exercise in causal inference!
Illustration of different strategies for mechanistic interpretability
Mechanistic Interpretability aims to produce statements like: "Model M solves task T by doing X."
To do so, many causal manipulations are performed to validate an explanation. But what if (many) other, incompatible explanations also pass the causal tests?
Abstract of the paper
Our paper "Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?" will be presented at #ICLR2025!
It's also the first paper of my first PhD student, congrats @maximemeloux.bsky.social ! π
blog: melouxm.github.io/MI-identifia...
An explanatory thread π§΅:
An assembly of 18 European companies, labs, and universities have banded together to launch πͺπΊ EuroBERT!
It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc.
Details in π§΅
bc i haven't done so yet, i decided to burn any remaining bridge to the land of statistics. it wasn't statisticians nor statistics but it was me. i am simply not good enough to do statistics myself.
so, @peyrardmax.bsky.social and i decided to turn statistical estimation into supervised learning.
Check out our new paper on social determinants of on-campus food choice, now out in @pnasnexus.org!
academic.oup.com/pnasnexus/ar...
Hey, thanks for making it, can you also add me
I tried to find everyone who works in the area but I certainly missed some folks so please lmk...
go.bsky.app/BYkRryU
Thanks for creating the pack, I am also working on this topic :)