Posts by Dan
Statistical Rethinking 2026 is done: 20 new lectures emphasizing logical and critical statistical workflow, from basics of probability theory to causal inference to reliable computation to sensitivity. It's all free, made just for you. Lecture list and links: github.com/rmcelreath/s...
Finally made the time to finish all of reproducible-data-science.dev
It's great, and you should read it.
There's certainly more available to be adopted than might be right for you / your team, but the layout is very clean and the topics get good treatment.
(Use rix! It's so useful.)
#rstats
I saw this when it was a preprint? They have a unique version of the estimand-estimator-estimates workflow, will be useful for may biologists, not just ecologists. We need more case studies for ecologists to emulate and for norms to shift so "regression and storytelling" becomes unpublishable.
Fisheries Catch Forecasting by Elizabeth Holmes
#RStats
bigbookofr.com/chapters/time%20series%2...
Screenshot of webR-based Android application demonstrating inline graphics within the console window instead of having graphics being outsourced to the "Plots" tab.
Graphics are... inline!??
The cover of Hatchet by Gary Paulsen, which has a young dude who IMHO looks like Luigi and superimposed over him are the outline of a howling wolf and a hatchet and also a crashing plane
This is super neat
#databs
sqlit is hands down the best SQL database TUI I've encountered, yet!
Blows harlequin out of the water!
github.com/Maxteabag/sq... #databs
Come learn about deep learning with us! The Data Science Learning Community just started reading the free third edition of Deep Learning with Python. Meetings are Tuesday at 3 pm CST.
DSLC: dslc.io
Book: deeplearningwithpython.io
#rstats #pydata
One of my aims at the time was to make sure we can show this relationship in simulated data, inspired by
@rmcelreath.bsky.social's approach to statistics. Statistical Rethinking is a life-changing book, this paper wouldn't be the way it is now without it. Very grateful to McElreath!
As the US economy screams ever higher on mostly AI bets, I feel obligated to put a marker down here and emphasize the business value of machine learning in the hope that this future AI winter will miss ML practitioners and researchers. AI winter is coming again when the bubble pops
Hudson Bay polar bears have nowhere to go (no ice of significance) & it's still much warmer than normal. When I started studying the bears here in 1984, they were back on the ice weeks earlier. At 1 kg mass loss/day, the bears would welcome some frigid weather.
βTens or hundreds of millions of dollars of taxpayer-funded NASA property and laboratories are at risk of either being discarded, mishandled, or out-of-commission for significant time periods.β ππ§ͺ
www.gesta-goddard.org/blog/gestas-...
Screenshot of title slide that reads "Efficient File Management in R with {fs}". Image of Jadey Ryan's cat hex logo with a witch hat joined by an orange heart with the fs hex sticker.
Messy folders haunting your R projects? π»
This Wednesday Oct 29 at 4:00 PM PDT, I'll lead a workshop on Efficient File Management in R with {fs} hosted by @r-ladies-stl.bsky.social. Let's clean and organize a spooky messy folder together!
Register at www.meetup.com/rladies-st-l...
#RStats #DataBS
You should read this wonderful little history of the #tidyverse, by @hadley.nz.
It reminded me about my early #rstats days as a PhD student (2011 - 2016), where I was constantly trying out the new things Hadley and crew were cooking up.
hadley.github.io/25-tidyverse...
A #Slurm user just confirmed that "yay it works. Pretty sick!"
Thanks to excellent feedback from several users, it'll soon be even easier to distribute #rstats code via #HPC job schedulers using future.batchtools
#parallel #futureverse
AI is powerful, but it's no free lunch - and again, it's no substitute for YOUR expertise.
R and QGIS, name a better combo
#databs
βI used R the statistical programming language to analyse each of the 3-hourly netCDFs β a file format for storing multidimensional scientific data β and create a geoJSON file where the data was greater than 35C. These files were then loaded into Qgis and styledβ¦.β - @sdbernard.bsky.social #RStats
Data science junkies, get ready! π "The Test Set" #podcast trailer is here for your viewing pleasure.
Tune in July 1st and every Tuesday after for new episodes with hosts @mchow.com, @hadley.nz, and @wesmckinney.com as they welcome thought leaders in #DataScience.
Subscribe now: pos.it/thetestset
Bleeding edge update for the #tidyverse purrr package with even more seamless #rstats parallel maps.
Introducing our shiniest new adverb: `in_parallel()`. Just wrap your function to take advantage of blazing fast parallel processing via mirai.
pak::pak("tidyverse/purrr")
purrr.tidyverse.org/dev/
Being able to productionize a ML model is often the goal, however there are many things to keep track of when you do. The orbital package lets you translate your fitted scikit-learn or tidymodels model into SQL that that when run produces predictions.
posit.co/blog/databri... #python #rstats
Claude 4 is pretty impressive π€
Screenshot of the Declaration Requests in Process table
How reliable are LLMs at extracting data from pdfs? Inspired by @simonwillison.net's PyCon talk, I added extracting FEMA's daily operation briefing to my LLM evals suite.
Just one model extracted the data from the pdf correctly: Gemini 2.5 Pro Preview. Full results -> kschaul.com/llm-evals/ev...
β Coffee and Coding β
Do you have an interesting piece of code/work to showcase, an opportunity for collaboration or a code dilemma you would like help with?
We would love to hear from you at Coffee and Coding β join the NHS-R Community Slack for more info (postcard.nhsrcommunity.com)!
#rstats
Statistical Rethinking with brms, ggplot2, and the tidyverse Second edition by A Solomon Kurz
#RStats
bigbookofr.com/chapters/statistics.html
Trying something new:
A π§΅ on a topic I find many students struggle with: "why do their π look more professional than my π?"
It's *lots* of tiny decisions that aren't the defaults in many libraries, so let's break down 1 simple graph by @jburnmurdoch.bsky.social
π www.ft.com/content/73a1...