Advertisement · 728 × 90

Posts by Calle Börstell

Screenshot of grid of some of my quarto extensions in a grid layout

Screenshot of grid of some of my quarto extensions in a grid layout

If you are doing any #quarto slidecrafting, i have been trying really hard to keep this page updated with everything I have done
emilhvitfeldt.com/project/slid...

3 days ago 66 20 2 0
Preview
ChatGPT’s latest stylistic quirk is sinister, infuriating – and absolutely everywhere | Stuart Heritage Once you start noticing “it’s not X, it’s Y” as you scroll online, you can’t fail to register it. I’ve become so hypervigilant that it has seeped into my subconscious thoughts

In 2019, I defended my PhD on 'It's not X, it's Y' and similar constructions, which I call contrastive negation constructions. Never in my wildest dreams would I have thought that 'It's not X, it's Y' would become the topic of comment pieces in newspapers.
🧵

1 week ago 100 36 6 8
The caption “Neil banging out the tunes April 13, 2006” under a photo of a rat playing on a tiny plastic piano

The caption “Neil banging out the tunes April 13, 2006” under a photo of a rat playing on a tiny plastic piano

And a very happy 20 years of Neil banging out the tunes to all who observe

1 week ago 1103 350 5 7
A table top showing 10 linguistics books facing up

A table top showing 10 linguistics books facing up

The campus bookstore is changing owners, hence putting their entire stock on sale. I got this haul of #linguistics book for 18€ (original price was 500€).

1 week ago 23 0 0 0
DTW correctly predicted the left gesture pair to be similar (lower distance) and the right pair to be dissimilar (higher distance)

DTW correctly predicted the left gesture pair to be similar (lower distance) and the right pair to be dissimilar (higher distance)

💡New publication💡
In our new paper in Behavior Research Methods, we validated and demonstrated the utility of dynamic time warping (DTW) as an efficient, continuous measure of gesture form similarity.

Read the full (open access!) paper here: 🔗 link.springer.com/article/10.3...

1 week ago 12 3 1 0

📦 My first #RStats package on CRAN:

{readelan}

A package dedicated to reading all files associated with ELAN: eaf, etf, ecv. Reads annotations, metadata, controlled vocabularies. Relevant for many in #linguistics perhaps?

More info here:
borstell.github.io/misc/readelan/

2 weeks ago 57 21 5 0

🥳 #RStats

2 weeks ago 31 0 1 0
Advertisement

I love a progress bar
|==========------|

2 weeks ago 5 0 0 0
Video

I love how little code is needed to make this from a video, using MediaPipe pose estimation into #ggplot2 #ggforce and #gganimate

2 weeks ago 10 0 1 0

Allosaurus

2 weeks ago 1 0 0 0
Batman breaking a firearm and saying, "This is the journal of the enemy. We do not need it. We will not use it."

Batman breaking a firearm and saying, "This is the journal of the enemy. We do not need it. We will not use it."

how I feel when open science initiatives choose to publish in Nature

2 weeks ago 232 40 4 4

It seems something may be wrong on the CRAN side, as incoming doesn't show any updates and there are others responding who also haven't received an email confirmation

3 weeks ago 1 0 0 0

This is the one I'm waiting for, it's been >1 hour 😭

3 weeks ago 2 0 2 0

Also, this is my very first CRAN submission ever, please clap (and pray)! 🙏

3 weeks ago 14 0 2 1

OK so how long should one wait for a CRAN submission email confirmation before contacting their admin? #RStats

Submitted with devtools::submit_cran() like 30 mins ago.

3 weeks ago 6 0 5 0
Advertisement

It looks vaguely Sinhala, and at least a few of the signs seem to match up somewhat to translations I could find, but I have no experience with what the handwriting looks like.

3 weeks ago 2 0 1 0
Post image

New Paper with Tongpan Rabo Fwangwar, titled "Iconicity and expressiveness of grammatical tones targeting ideophones in Mwaghavul".

All tonally derived verbs in Mwaghavul have ideophonic bases and are derived through one of two lexically determined alternations.

doi.org/10.1007/s110...

3 weeks ago 13 6 1 0

Why is working with XML in #RStats so hard 😭

3 weeks ago 5 1 2 0

Dance like someone's watching

4 weeks ago 4 1 0 0
A circle formed by many small tiles (as pixels) in the gold to orange color spectrum, resembling a sun, with the Pi symbol vaguely visible at the center.

A circle formed by many small tiles (as pixels) in the gold to orange color spectrum, resembling a sun, with the Pi symbol vaguely visible at the center.

1 million decimals of pi
#TidyTuesday

Polar coordinates of each digit as ordered tiles, digit value corresponding to color 🟡🌞
#R4DS #DataViz #ggplot2

Code: github.com/borstell/tid...

4 weeks ago 12 0 0 0

Sharing a PhD opportunity at Stockholm University for those interested in Romance languages and sociolinguistics. Great fit for scholars working at the intersection of linguistics and social analysis. #AcademicJobs #sociology

linguistlist.org/issues/37/1161/?utm_source=dlv...

1 month ago 6 8 0 0
A split collage image showing a bridge in front of a lake and mountain background, the right image showing a zoomed-in version of a sign on the bridge reading "fartsløp"

A split collage image showing a bridge in front of a lake and mountain background, the right image showing a zoomed-in version of a sign on the bridge reading "fartsløp"

Welcome to Bergen 🇳🇴

1 month ago 9 1 2 1
A word cloud of terms from paper titles for the LREC sign language workshop. The most prominent words are "sign", "language", "corpus" and "data".

A word cloud of terms from paper titles for the LREC sign language workshop. The most prominent words are "sign", "language", "corpus" and "data".

One of my favorite #linguistics conferences, the LREC sign language workshop, has just released their list of accepted papers to the 2026 edition:
www.sign-lang.uni-hamburg.de/lrec2026/pro...

Image below shows the most frequent words in the paper titles.

1 month ago 5 2 0 0
Data Organization in Spreadsheets
Karl W. Broman
& Kara H. Woo
Pages 2-10 | Received 01 Jun 2017, Accepted author version posted online: 29 Sep 2017, Published online: 24 Apr 2018

    1. Introduction
    2. Be Consistent
    3. Choose Good Names for Things
    4. Write Dates as YYYY-MM-DD
    5. No Empty Cells
    6. Put Just One Thing in a Cell
    7. Make it a Rectangle
    8. Create a Data Dictionary
    9. No Calculations in the Raw Data Files
    10. Do Not Use Font Color or Highlighting as Data
    11. Make Backups
    12. Use Data Validation to Avoid Errors
    13. Save the Data in Plain Text Files

ABSTRACT

Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this article offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses. The basic principles are: be consistent, write dates like YYYY-MM-DD, do not leave any cells empty, put just one thing in a cell, organize the data as a single rectangle (with subjects as rows and variables as columns, and with a single header row), create a data dictionary, do not include calculations in the raw data files, do not use font color or highlighting as data, choose good names for things, make backups, use data validation to avoid data entry errors, and save the data in plain text files.

Data Organization in Spreadsheets Karl W. Broman & Kara H. Woo Pages 2-10 | Received 01 Jun 2017, Accepted author version posted online: 29 Sep 2017, Published online: 24 Apr 2018 1. Introduction 2. Be Consistent 3. Choose Good Names for Things 4. Write Dates as YYYY-MM-DD 5. No Empty Cells 6. Put Just One Thing in a Cell 7. Make it a Rectangle 8. Create a Data Dictionary 9. No Calculations in the Raw Data Files 10. Do Not Use Font Color or Highlighting as Data 11. Make Backups 12. Use Data Validation to Avoid Errors 13. Save the Data in Plain Text Files ABSTRACT Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this article offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses. The basic principles are: be consistent, write dates like YYYY-MM-DD, do not leave any cells empty, put just one thing in a cell, organize the data as a single rectangle (with subjects as rows and variables as columns, and with a single header row), create a data dictionary, do not include calculations in the raw data files, do not use font color or highlighting as data, choose good names for things, make backups, use data validation to avoid data entry errors, and save the data in plain text files.

Every day is a good day for sharing one of the most useful papers about research data ever written. PLEASE get your people to understand and follow this advice.

www.tandfonline.com/doi/full/10....

1 month ago 1050 402 31 47

The fee to publish an open access paper at Trends in Cognitive Science is now over $7,000. Seriously, @elsevierconnect.bsky.social ??? For a 4,000 word piece? Talk about a broken system.

1 month ago 43 14 3 0
Advertisement

Whoever designed the recommendation letter forms that make the assumption that I categorize my students into percentiles of how they rank among all students I've ever taught:

I rank you in the bottom 1% of all survey designers.

1 month ago 29 2 1 0
A four-panel plot each showing a black line moving in a more and more squiggly  manner as the line expands

A four-panel plot each showing a black line moving in a more and more squiggly manner as the line expands

Random walk #Rtistry with #ggplot2

1 month ago 11 1 0 0

Computational Papers (CPs) are always Annoying To Read (ATR) because of their Extensive Use of Abbreviations (EUA). After Some Time (AST) reading a CP, there are No Words (NW) left TR due to the EUA, and I Almost Give Up On Even Trying (IAGUOET) TR the CP because it is so ATR any text with EUA.

1 month ago 7 1 0 0
A black background with white/gray text with the title "Likelihood of likelihood phrases: As judged by 5000+ people in Kucharski (2026)". The proportional fill of each word corresponds to its median %-likelihood, from "Will Happen" and "Almost Certain" which are completely filled, to "Remote Chance", "Highly Unlikely" And "Almost No Chance" which are all almost completely unfilled. Data: Kucharski (2026); Packages: {tidyverse, marquee}; Visualization: C. Börstell.

A black background with white/gray text with the title "Likelihood of likelihood phrases: As judged by 5000+ people in Kucharski (2026)". The proportional fill of each word corresponds to its median %-likelihood, from "Will Happen" and "Almost Certain" which are completely filled, to "Remote Chance", "Highly Unlikely" And "Almost No Chance" which are all almost completely unfilled. Data: Kucharski (2026); Packages: {tidyverse, marquee}; Visualization: C. Börstell.

Going very minimal for this week's #TidyTuesday looking at judged likelihood of likelihood-expressing phrases in English.

Code: github.com/borstell/tid...

#R4DS #ggplot2 #DataViz

1 month ago 21 2 2 0
1 month ago 168 43 2 1