Ryan Batten, PhD(c) (@ryanbatten) Bsky

Is there a reason that risk difference isn't used more for time to event outcomes?

For example, estimate baseline hazard and hazard at 5 years, then take the difference (i.e., using a Royston-Parmar model)

#CausalSky #StatsSky

4 weeks ago 1 0 0 0

Draw a DAG to uncover your assumptions from the trenchcoat.

1 month ago 18 1 1 1

Oh nice!

1 month ago 0 0 0 0

Fantastic initative! Especially the search function

It'd be neat to compare assumptions trying to answer same question, to see how much consensus there is among domain experts

1 month ago 3 1 1 0

Open Causal (Beta)

A very nice to initiative where you can post your DAG: opencausal.org

And because they're machine readable, they're much easier to search.

1 month ago 15 10 2 0

A Few Claude Skills for R Users – R Works The community has come together to create some great Claude Skills that you can try out today.

I rounded up a few Claude Skills for #RStats users.

Huge thanks to the creators who developed them. They share Skills for everything from tidyverse code to brand.yml files to learning while using AI.

Hope the list is useful, and please let me know what I missed! 🧡

rworks.dev/posts/claude...

1 month ago 155 40 4 6

No amount of statistical gymnastics will save data that doesn't support what you're trying to do

1 month ago 0 0 0 0

I'd agree, because what the unmeasured confounder "does" to the results depends on the relationship with the other variables.

Its quite possible theres an unmeasured confounder, but it doesnt matter because in the adjustment set all backdoor paths are blocked anyways

1 month ago 0 0 0 0

This is great.
I've given an example in class showing how priors can make some unidentified problems identifiable:

a+b=6, solve for b.
Prior: a=0~4

1 month ago 14 3 0 1

"the House of Mum and Dad" reads like Game of Thrones

1 month ago 2 0 1 0

a cartoon of mario peeking out of a green box ALT: a cartoon of mario peeking out of a green box

Reading that code be like

1 month ago 1 0 0 0

Hadn't heard of the CACE estimand before!

After quick search, looks interesting. Excited to learn more about it. Thanks!

1 month ago 0 0 0 0

😂

1 month ago 1 0 0 0

Also haven't heard about ITT as a way to describe missing data, need to learn more about that

1 month ago 0 0 0 0

That's a fair point, about the adherence. To me, it could still be potentially useful but just estimating something different than the original intervention (especially if 20-30% non adherence)

A+ Tom Platz reference, wasnt expecting that here 😂

1 month ago 1 0 2 0

Ah interesting

Id always thought of ITT as mostly helpful due to keeping the randomization (and benefits of that), but per-protocol to see effect of the intervention itself (rather than act of randomizing, of course needs additional methods).

Thanks for sharing your perspective!

1 month ago 1 0 1 0

Admittedly it was a made up example, trying to highlight the issue of non-adherence

1 month ago 0 0 1 0

The more I learn about stats, the more I use these three things:

1/ Plots - a picture is worth a thousand words

2/ Probability can almost always guide you

3/ Simulation - when it doubt, simulate it out

1 month ago 1 0 0 0

Yes! Specificially DAGs: "what about the possibility you forgot a variable?"

1 month ago 1 0 0 0

(In saying that, I still think its useful for design benefits but should also include per-protocol effects, with appropriate methods)

1 month ago 0 0 0 0

To me, I always find the ITT effect interpretation odd. For example: "does weightlifting cause strength increases"

I would be interested in the effect of weightlifting, not if I intended to lift weights (but never did)

1 month ago 2 0 2 0

For example, say we have propensity scores for both groups. However there is a lack of overlap.

We decide to focus on the area where there is overlap.

We do this by applying overlap weights.

The population these results apply to would be the overlap population!

2/2

1 year ago 1 0 0 0

Average treatment effect in the overlap can be a tricky causal estimand. Why?

The ATO is a little different than other estimands.

Often, it's not well defined before the analysis.

This is because there are many ways to define the population.

Instead, it's based on the statistical method.

1/2

1 year ago 0 0 1 0

The third installment of the “how should we actually construct our causal graphs anyway” series is out now! 👇🏼

Nick & I ask the question: can we just get an LLM to tell us what belongs on the graph?

1 year ago 44 13 1 0

Causal inference on human behaviour - Nature Human Behaviour In this Review, Drew Bailey et al. present an accessible, non-technical overview of key challenges for causal inference in studies of human behaviour as well as methodological solutions to these chall...

A few papers I think worth reading. Mostly open access.

Causal inference is hard:

www.nature.com/articles/s41...

1 year ago 163 52 10 6

The more obscure a statistical analysis method, the more I question the design.

Not saying it's wrong, but I'd have questions why a more "common" approach wasn't used.

1 year ago 2 0 0 0

Bootstrapping is sort of a semi-Bayesian approach when you think about it

1 year ago 3 0 0 0

Calling bullshit - a skill that every applied statistician should master. Unfortunately many of the younger statisticians I’ve worked with sometimes lack the bravery to do so. The book looks like a must-have. #Statistics #StatsSky @carlbergstrom.com @carlzimmer.bsky.social

1 year ago 59 20 4 0

A common critique of Bayesian methods is that priors are arbitrary. I think that's a good thing. It's an assumption, like much of science.

Better to be explicit about assumptions (i.e., DAGs, priors, etc) than implicit

1 year ago 1 0 0 0

ggplot2 is like electricity. I don't need it to survive, but I much prefer it

1 year ago 4 2 1 0

Posts by Ryan Batten, PhD(c)