František Bartoš (@fbartos) Bsky

Little evidence for group differences in 2D:4D ratios based on sexual orientation after adjusting for publication bias The ratio between the lengths of the second and fourth digits (2D:4D) has been proposed as a putative marker of prenatal androgen exposure and investi…

Our paper is now out in Hormones and Behaviour.
We found little evidence for group differences in 2D:4D ratios by sexual orientation when jointly modeling publication bias and heterogeneity. w/ @fbartos.bsky.social, Ben Jones & @tvpollet.bsky.social

www.sciencedirect.com/science/arti...

🧵1/9

1 month ago 8 5 1 1

Workshops - JASP - Free and User-Friendly Statistical Software The JASP team is excited to support four workshops this year (2026). Each of these workshops can be attended either in person (at the University of Amsterdam) or online: On Monday August 24th, we orga...

Interested in finding out more about @jaspstats.bsky.social? The Psych Methods Group at the University of Amsterdam is running four hands-on workshops (in person or online) this summer (@ejwagenmakers.bsky.social, @fbartos.bsky.social). More information at jasp-stats.org/workshops/ but to sum up:

1 month ago 13 5 1 0

Everything is ready for the Perspectives on Scientific Error conference that starts tomorrow in Leiden! I look forward to hanging out with the mix of metascientists, philosophers of science, and statisticians! So many old friends will be there (and hopefully some new ones)! #PSE8

2 months ago 51 9 0 2

Come to Amsterdam or join online for the full week of JASP workshops (24th-28th of August)! If you can't do the full week or you are only interested in meta-analysis, I will be giving the Meta-Analysis workshop on 25th of August.

jasp-stats.org/2026/02/05/h...

2 months ago 4 1 0 0

Diagram showing four phases of methodological research (Theory, Exploration, Systematic Comparison, Evidence Synthesis) with an arrow indicating that preregistration usefulness increases from early to late phases. Each phase lists its aim, elements, outcome, and an example from factor retention research.

Does it make sense to preregister simulation studies?
This question has sparked a lot of debate.

▶️We* work through the why, when, and how
▶️We discuss different phases of methodological research to clarify where preregistration might (or might not) add value

📝 Preprint: doi.org/10.31234/osf...

2 months ago 37 13 1 0

Does it mean that AI/LLMs do not help at education? I personally don't think so. I'm using the AI every day and find it incredibly useful. It would be odd if they didn't help at learning at all. However, the current empirical base does not substantiate strong claims.

2 months ago 1 1 0 0

Meta-analysis level re-analysis then further highlights the issue of publication bias. Extremely overstated evidence (left) and mean effect size estimates (middle) due to a large degree of publication bias (right).

2 months ago 1 0 1 0

We explored several moderators and compared results of studies published before and after 2023 (to assess older AI systems and modern LLMs) but we did not find any meaningful difference.

2 months ago 0 0 1 0

Publication bias-adjusted estimates decrease the average effect from d = 0.63 to d = 0.20. More importantly, the between-study heterogeneity is so large that the distribution of effects can range from -1.52 to 1.91! This is a ridiculous variance making the mean meaningless.

2 months ago 1 0 1 0

We managed to collect 1,840 effect size estimates from 67 meta-analyses. The distribution of study-level effect sizes shows both a notable skew (funnel plot on the left) and clear selection for positive effects (z-curve plots on the right).

2 months ago 0 0 1 0

We recently criticized one meta-analysis on the effect if ChatGPT on learning for failing to adjust for publication bias (bsky.app/profile/fbar...). In a response, the original authors argued that many other meta-analyses find the same effects. So we examined them all.

2 months ago 0 0 1 0

We just posted a preprint with a comprehensive meta-meta-analysis of the effects of AI/LLMs on learning.

TLDR:
- 1,840 effect sizes
- extreme between-study heterogeneity
- extreme publication bias
- small average effects (three times lower than usually reported)
(osf.io/preprints/ps...)

2 months ago 8 1 1 0

Redefine Statistical Significance Part XXI: Edgeworth Proposed the .005 Criterion Back in 1885 The statistical significance test was not invented by Ronald Fisher. The key idea was already laid out by Francis Ysidro Edgeworth (1845-1926), whose 1885 article “Methods of statistics&#8221…

Edgeworth proposed the alpha=.005 criterion 134 years prior to Benjamin et al. (2019). :-)
www.bayesianspectacles.org/redefine-sta...

3 months ago 1 1 0 0

JASP for Quality Control, Example 6: The World's Earliest Recorded Outlier? - JASP Services BV Information hidden in raw data can be revealed most easily by means of statistical visualization techniques (“always plot your data”). To demonstrate I will now analyze the duration of reign (in years...

The world's earliest recorded outlier? Let me know if you have an even older example!

www.jasp-services.com/jasp-for-qua...

3 months ago 2 1 0 0

Surprisingly never in the case of publication bias tests :D

3 months ago 3 0 1 0

"we did not find any evidence for publication bias (p=0.077)"

3 months ago 3 0 1 0

This is also likely to be the last update of this version of the package. Next year, I will introduce breaking changes to the interface with the 4.0 major release, which will make the interface much more similar to metafor.

3 months ago 3 0 1 0

Guide to RoBMA Vignettes

As such, it provides an easy-to-apply state-of-the-art Bayesian meta-analytic methodology for most meta-analytic settings!

See an overview of the current functionality with a brief description of all vignettes fbartos.github.io/RoBMA/articl...

3 months ago 2 0 1 0

Multilevel Robust Bayesian Meta-Analysis

The Robust Bayesian Meta-Analysis package got updated with additional vignettes explaining how to perform Bayesian model-averaged publication bias-adjusted

- multilevel meta-analysis (cran.r-project.org/web/packages...)
- multilevel meta-regression (cran.r-project.org/web/packages...)

3 months ago 25 11 1 0

At the Albert Heijn, You Get About 2% More Potatoes Than What it Says on the Label - JASP Services BV A week ago I started a small quality control project where I measured twenty “1 kg” Albert Heijn (AH) potato bags in order to assess whether or not AH is systematically underfilling them, as some soci...

I recently bought 20 bags of potato that, according to the Albert Heijn supermarket, should each contain 1 kg. This turns out to be *false*.

www.jasp-services.com/at-the-alber...

4 months ago 4 3 0 0

Yep, its ridiculous. Those studies should not be published...

Extracting the study-level data from existing meta-analyses is quite feasible, so, there is almost no excuse not to do so.

4 months ago 2 0 1 0

OSF

Also, you cannot really evaluate between-study heterogeneity, see e.g. our latest study-level meta-meta-analysis that shows the limitations of the previous meta-analysis-level meta-meta-analysis doi.org/10.31234/osf...

4 months ago 2 0 1 0

My main worry is that they might have synthesized the meta-analytic estimates rather than the study-level estimates? The manuscript wasn't super clear on that and the OSF had only meta-analysis level data?
If so, that makes the publication bias adjustment ineffective...

4 months ago 2 0 1 0

Do the 1kg Albert Heijn Potato Bags Really Contain 1kg of Potatoes? - JASP Services BV A few days ago I announced a small quality control project where I would measure 20 bags of “1 kg” Albert Heijn (AH) potato bags to assess whether or not AH is systematically underfilling them, as som...

The suspense is building: do the measurements of 20 units indicate that the Albert Heijn underfills its 1 kg bags of potatoes? An interim post on the importance of articulating your predictions *before* seeing the results. :-)

www.jasp-services.com/do-the-1kg-a...

4 months ago 6 4 0 0

JASP for Quality Control, Example 4: The Raincloud Plot - JASP Services BV In our last post we discussed the boxplot of the distances to the sun for each of the eight planets in our solar system, as measured in astronomical units (AU; AU=1 is the average distance from the ea...

This week's blog post features "raincloud plots", a relatively recent development in data visualization.

Will the raincloud plot gradually replace the box plot? It just might!

Check out the raincloud plot for the planets in our solar system at

www.jasp-services.com/jasp-for-qua...

4 months ago 5 5 0 0

Also, this should not be a reason to stop exercising.
1) There are other benefits of exercise
2) Some populations/exercises show benefit
3) There might be wider effects on cognition; however, the literature is too heterogeneous and contaminated with publication bias to be certain

4 months ago 13 0 0 0

I think that the field needs to clean up the published literature a bit. Additional small studies are not going to move the needle at this point; maybe a couple of large-scale, pre-registered studies might provide more insight?

4 months ago 11 2 1 0

We also re-analyzed all of the original meta-analyses individually. Many of them are consistent with publication bias: the evidence for and the degree of the pooled effects decrease once publication bias is adjusted for.

4 months ago 2 0 1 0

We run subgroup analyses for each outcome/population/intervention. We found that most results are too heterogeneous to tell (see wide prediction intervals), but some interventions seem to be promising and some have substantive evidence against them. See figures for each outcome.

4 months ago 1 0 1 0

First, we found notable publication bias, especially in studies on general cognition and executive function. Importantly, there was extreme between-study heterogeneity (tau ~ 0.3-0.6!). This means that the results were consistent with both large benefit but also large harm.

4 months ago 3 0 1 0

Posts by František Bartoš