Don't miss out on new replications with the Zotero Replication Checker!
This is the current set of >1.6K studies in the FORRT Library of Replication Attempts (#FLoRA). Many of them belong to large-scale projects and are not even cited in the final report, so how to keep track of them?
Posts by Koki Ikeda
OK folks this reads to me like a stacked pile of nothing put in a box labeled 'wow, so interesting!' and if there is something anyone thinks I am missing (I'm talking to you, bluesky-doesn't-get-AI guys) I would like to know what it is. Here's how I read it: 1/n
www.anthropic.com/research/emo...
New preprint out today (osf.io/preprints/ps...). We tested whether AI agents are actually infiltrating online surveys.
Spoiler alert: they aren't
Thread ๐งต
[1/9]
The SCORE investigation of repeatability and credibility is a lot. There are a few ways to get your head around it.
1: The Nature collection includes 3 papers from SCORE, an amazing paper from @i4replication.bsky.social and several commentaries about the work.
www.nature.com/collections/...
1/
SCORE, a collaboration of 865 researchers, is now released as three papers in Nature, six preprints, and a lot of data (cos.io/score/). SCORE examined repeatability of findings from the social-behavioral sciences and tested whether human and automated methods could predict replicability.
1/๐งต A major update to our paper: "Scaling Reproducibility" w/ Leo Yang [Cross-posted from X]
We move beyond reanalyzing a single design to (almost) full-paper replication!
Paper: bit.ly/repro-ai
I describe it as such as a historian of disability because I want to remind scientists and the world that the history of science IS the history of eugenics, scientific racism and medical racism, and more.
Iโm not comforting them with the idea that it was a โpseudoscience.โ
This is your history.
Three years ago I was wondering whether LLM could act in โsocial groupsโ of any sort, and how such networks would resemble real ones.
A brilliant student, Nicola Zomer, came to me to ask for a thesis on exactly this question.
Today, that work is published in NPJ AI!
Paper: rdcu.be/e9gRH
1/๐งต๐งช
With Eugene Koonin, we propose a concept of โthe selfish ribosomeโ, under which evolution of life is viewed as a ribosomal takeover, where the ribosome evolved to consume most of the cellโs resources, while other cellular componentry ensures the propagation of the ribosome. arxiv.org/abs/2602.23268
Bots have made their way to Prolific experiments. Our lab has stopped online testing of adults entirely now for this reason - we want to know if what we study is real. Probably data collected 2-3 years ago are ok, but moving forward we just can't know. www.pnas.org/doi/10.1073/...
How horrible to be a CS grad student under pressure to submit multiple first author papers to every conference deadline, whether they feel ready or not. This serves no oneโs best interests in long run (science included). But lots of students appear to being getting advice itโs necessary to compete
Within the topic of AI alignment, there are a million tinier, but consequential, alignment choices.
This paper looks at the willingness of AI to engage in scientific misconduct (p-hacking). The most recent AIs resist instructions to p-hack and do good analysis, but the guardrails can be breached.
New preprint: โA dynamical perspective on biological reproductionโ
hal.science/hal-05491732
The prevailing view is that an organism reproduces by building a new organism from its genomic representation, like von Neumannโs self-reproducing machine. But...
1/11
๐ โ ๐งช The Story is Not the Science.
Code is submitted but rarely executed during peer reviewโan issue likely to worsen with research agents. ๐งโ๐ฌ
We introduce ๐๐๐๐ก๐๐ฏ๐๐ฅ๐๐ ๐๐ง๐ญ, an execution-grounded evaluation of narrative + execution. ๐๐๐ซ๐ข๐๐ฒ ๐ญ๐ก๐ ๐ฌ๐๐ข๐๐ง๐๐, ๐ง๐จ๐ญ ๐ฃ๐ฎ๐ฌ๐ญ ๐ญ๐ก๐ ๐ฌ๐ญ๐จ๐ซ๐ฒ.
1/n
I have plenty of reservations about the reliability of funnel-plots and z-curves and such, as well as their interpretation.....
But holy shit look at that.
I had intended to post something about this new Google DeepMind paper that appeared yesterday in Nature, but the press coverage has added to what there is to say. So this is a long ๐งต
www.nature.com/articles/s41...
If you tell an AI to convince someone of a true vs. false claim, does truth win? In our *new* working paper, we find...
โLLMs can effectively convince people to believe conspiraciesโ
But telling the AI not to lie might help.
Details in thread
An attempt to express how I principally use LLMs.
Rotating the Space: On LLMs as a Medium for Thought
sbgeoaiphd.github.io/rotating_the...
How complex should network models be?
๐จ In our latest paper we quantify (if and) when higher-order interactions are informative versus reducible to pairwise structure without losing functional signal (e.g., diffusion behavior).
๐ www.nature.com/articles/s41...
1/
What if animals emerged by installing a new biological operating system that repurposed what already existed, much like the rise of the smartphone? Here's our new paper in @embojournal.org @ibe-barcelona.bsky.social @melisupf.bsky.social @sfiscience.bsky.social link.springer.com/article/10.1...
We keep saying: "AI will handle the boring stuff, and humans will supervise." But the problem is--as AI reliability improves, it becomes really hard to motivate a human to conscientiously monitor it.
In a new WP with Gerard Cachon, we describe the "human-AI contracting paradox."
๐ Measuring Intrinsic Dimension of Token Embeddings (2025) arxiv.org/abs/2503.02142
๐ Do We Really Need All Those Dimensions?
An Intrinsic Evaluation Framework for Compressed Embeddings (EMNLP 2025) aclanthology.org/anthology-fi...
A new, long paper on evolution - natural induction - split into 2:
royalsocietypublishing.org/rsfs/article...
royalsocietypublishing.org/rsfs/article...
@RichardWatson90 and Tim Lewens
I know some of you have strong views on LLMs and might not agree with me on this, but if you genuinely value diversity in academia, e.g., in welcoming neurodivergent researchers and non-native speakers, then I think you should acknowledge the positive influence AI can have in fostering inclusivity.
Happy to share my new paper w/ @cgershen.bsky.social, just published at @royalsocietypublishing.org Interface!
Open Access๐: royalsocietypublishing.org/rsif/article...
Instead of proposing a new theory, we offer a synthesis in theoretical biology. Want to know more? Read the full thread./1 ๐๐งต
Benchmarks from historians show that AI transcription from handwriting is now better than human, and a very cheap model is as good as people.
There are now massive troves of documents that could be made available for research that would have been impossible or prohibitive to transcribe before.
๐จ What if evolution is the โlawโโฆ and networks are the machines that do the work?
In this paper (just published) I try to formalize how living systems are non-equilibrium, information-processing, adaptive matter. With a great biological flavor! ๐งช๐๐๐งฌ๐ฆ
๐ iopscience.iop.org/article/10.1...
๐งต 1/
Centuries of ontological dualisms (even recently permeating the literature) have muddied goal-directedness as something mystical. ๐ฎ
Itโs time to naturalize this concept and unpack its relationship to agency!/1
#complexitycat ๐ผ
www.complexitycat.org/posts/goal-d...