Advertisement · 728 × 90

Posts by Eivinas Butkus

Post image

The Causality in Cognition Lab -- a supportive, bluesky-colored team -- is looking for a predoc to join us! Here are infos about the lab (cicl.stanford.edu) and the position (careersearch.stanford.edu/jobs/iriss-p...). The application deadline is May 1st.

Please share, thank you 🙏

2 weeks ago 62 36 2 0

How attention saves energy in vision www.biorxiv.org/content/10.64898/2026.03...

1 month ago 8 4 0 0

🚨 #CCN2026 Proceedings submissions are open!
CCN 2026 again features an 8-page Proceedings track (alongside extended abstracts). Accepted papers will appear in CCN-Proceedings (CCN‑P) with DOIs on OpenReview.

2 months ago 33 22 1 6

Thrilled to start 2026 as faculty in Psych & CS
@ualberta.bsky.social + Amii.ca Fellow! 🥳 Recruiting students to develop theories of cognition in natural & artificial systems 🤖💭🧠. Find me at #NeurIPS2025 workshops (speaking coginterp.github.io/neurips2025 & organising @dataonbrainmind.bsky.social)

4 months ago 105 27 4 1

congrats!!

4 months ago 1 0 1 0
a red building on UPENN's campus photographed during the fall

a red building on UPENN's campus photographed during the fall

the Philadelphia skyline, with clear skies and autumn trees

the Philadelphia skyline, with clear skies and autumn trees

starting fall 2026 i'll be an assistant professor at @upenn.edu 🥳

my lab will develop scalable models/theories of human behavior, focused on memory and perception

currently recruiting PhD students in psychology, neuroscience, & computer science!

reach out if you're interested 😊

4 months ago 227 45 25 3

21/ Poster #2510 (Fri 5 Dec, 4:30-7:30pm PT) at the main conference. Also at the Mechanistic Interpretability Workshop at NeurIPS (Sun 7 Dec). Come chat!

5 months ago 1 0 0 0

20/ Thanks to the reviewers and the NeurIPS 2025 community. Thanks also to @zfjoshying, Yushu Pan, @eliasbareinboim (and the CausalAI Lab at Columbia University) for helpful feedback on this work.

5 months ago 1 0 1 0
Video

19/ Open questions: Do these findings extend to real-world LLMs, more complex causal structures? Could video models like Sora be learning physics simulators internally? We’re excited to explore this in future work.

5 months ago 1 0 1 0
Advertisement

18/ Limitations: This is an existence proof in a constrained setting (linear Gaussian SCMs, artificial language). Let us know if you have ideas on how to test this in real LLMs. What may we be missing?

5 months ago 0 0 1 0

17/ Broader implications: Even when trained using “statistical” prediction, neural networks can develop sophisticated internal machinery (compositional, symbolic structures) that support genuine causal models and reasoning. The “causal parrot” framing may be too limited.

5 months ago 0 0 1 0

16/ Three lines of evidence that the model possesses genuine causal models: (1) it generalizes to novel structure-query combinations, (2) it learns decodable causal representations, and (3) representations can be causally manipulated with predictable effects.

5 months ago 0 0 1 0
Post image

15/ Result 3 (continued): When we intervene on layer activations to change weight w_12 from 0 → 1, the model's predictions flip to match the modified causal structure. This suggests we’re decoding the actual underlying representation the model uses. (See paper for more quantitative results.)

5 months ago 0 0 1 0
Post image

14/ Result 3: We can manipulate the model's internal causal representation mid-computation using gradient descent (following the technique from Li et al. 2023). Changing the SCM weights using the probe produces predictable changes in the network’s outputs.

5 months ago 0 0 1 0
Post image

13/ Result 2: We can decode the SCM weights directly from the transformer's residual stream activations using linear and MLP probes. The model builds interpretable internal representations of causal structure.

5 months ago 0 0 1 0

12/ But does the model really have internal causal models, or is this just lucky generalization? We probe inside to find out...

5 months ago 0 0 1 0
Post image

11/ Result 1: Yes! The model generalizes to counterfactual queries about D_test SCMs, reaching near-optimal performance. It must have: (1) learned a counterfactual inference engine, (2) discovered D_test structures from DATA strings, (3) composed them together.

5 months ago 0 0 1 0
Advertisement
Post image

10/ The generalization challenge: We hold out 1,000 SCMs (D_test) where the model sees *only* interventional DATA strings during training and zero INFERENCE examples. Can it still answer counterfactual queries about these SCMs?

5 months ago 0 0 1 0
Post image

9/ We train a GPT-style transformer to predict the next token in this text. The key question: does it simply memorize the training data, or does it discover causal structure and perform inference?

5 months ago 1 0 1 0
Post image

8/ Our setup: ~59k SCMs, each defined by 10 ternary weights. We generate training data using the SCMs in an artificial language with two string types: (1) DATA provide noisy interventional samples and (2) INFERENCE — counterfactual means/stds.

5 months ago 0 0 1 0
Post image

7/ Our hypothesis: next-token prediction can drive the emergence of genuine causal models and inference capabilities. We tested this in a controlled setting with linear Gaussian structural causal models (SCMs).

5 months ago 0 0 1 0
Post image

6/ Pearl (Amstat News interview, 2023) and Zečević et al. (2023) acknowledge causal info exists in text, but argue LLMs are merely "causal parrots"—they memorize and recite but do not possess actual causal models.

5 months ago 0 0 1 0
Post image

5/ But natural language used to train LLMs contains rich descriptions of interventions and causal inferences. Passive data != observational data. Text has L2/L3 information!

5 months ago 0 0 1 0
Post image

4/ Pearl’s vivid metaphor from “The Book of Why” (p. 362): “Like prisoners in Plato's cave, deep-learning systems explore shadows on the wall and learn to predict their movements. They lack understanding that shadows are mere projections of 3D objects in 3D space.”

5 months ago 1 0 1 0
Post image

3/ Pearl's argument: DNNs trained on “passive” observations using statistical prediction objectives are fundamentally limited to associations (L1) and cannot reason about interventions (L2) or counterfactuals (L3).

5 months ago 0 0 1 0

2/ Paper: “Causal Discovery and Inference through Next-Token Prediction” (Butkus & Kriegeskorte)
OpenReview: openreview.net/pdf?id=MMYTA...,
NeurIPS: neurips.cc/virtual/2025...

5 months ago 1 0 1 0
Post image

1/ Can causal models and causal inference engines emerge through next-token prediction? Judea Pearl and others (Zečević et al. 2023) have argued no. We present behavioral and mechanistic evidence that this is possible. #neurips2025 #NeurIPS

5 months ago 7 1 1 0
Advertisement
Preview
Hippocampal encoding of memories in human infants Humans lack memories for specific events from the first few years of life. We investigated the mechanistic basis of this infantile amnesia by scanning the brains of awake infants with functional magne...

Why do we not remember being a baby? One idea is that the hippocampus, which is essential for episodic memory in adults, is too immature to form individual memories in infancy. We tested this using awake infant fMRI, new in @science.org #ScienceResearch www.science.org/doi/10.1126/...

1 year ago 482 166 19 21
Preview
Key-value memory in the brain Classical models of memory in psychology and neuroscience rely on similarity-based retrieval of stored patterns, where similarity is a function of retrieval cues and the stored patterns. While parsimo...

Key-value memory is an important concept in modern machine learning (e.g., transformers). Ila Fiete, Kazuki Irie, and I have written a paper showing how key-value memory provides a way of thinking about memory organization in the brain:
arxiv.org/abs/2501.02950

1 year ago 122 31 3 1

reminds me of Dennett's take that we should be building "tools not colleagues"

1 year ago 4 0 1 0