The call for papers for the East Coast Organization of Language Testers (ECOLT) 2026 is now live, and we are proud to be hosting it at Georgia State University on November 6, 2026. Check out the call and join us here in Atlanta later this year!
sites.google.com/view/ecolt
Posts by Dylan Burton
Language Testing Volume 43 Issue 2 is now available! In this issue: 2 original articles, 2 systematic reviews, 1 brief report, and 2 book reviews. Check out our spotlight on the systematic reviews in this thread ⬇️. Full issue is available here: journals.sagepub.com/toc/ltja/43/2
Journal of Psycholinguistic Research is doing some major shaping up; Brysbaert and Folk outline new editorial guidelines to promote robust, reproducible quantitative research with strong conceptual foundations:
link.springer.com/article/10.1...
Check out my new article! Here I look at nonverbal behaviors that listeners mention most often when listening to L2 speech and how they perceive their function in communication.
onlinelibrary.wiley.com/doi/10.1002/...
Check out Lynda Taylor's piece in #LanguageTesting on what it means to be a member of a professional association in our field. doi.org/10.1177/0265... (1/2)
This pioneering Virtual Special Issue by Salomé Villa Larenas on the #GlobalSouth is consequential in bringing to light regional disparities in our journal and field and in shifting the focus to underrepresented contexts. doi.org/10.1177/0265... (1/4)
screenshot of my post
Big new blogpost!
My guide to data visualization, which includes a very long table of contents, tons of charts, and more.
--> Why data visualization matters and how to make charts more effective, clear, transparent, and sometimes, beautiful.
www.scientificdiscovery.dev/p/salonis-gu...
Do you know Zoom & other companies are encouraging discrimination against employees for things like "pausing too long" or not sounding sufficiently "charismatic"? Read all about our dystopian present in my new paper, out today in JASA!
(Seriously, read it. It's important)
doi.org/10.1121/10.0...
How do untrained listeners report using nonverbal behavior in their perceptions and judgements of L2 ability? I explore this topic and more in my latest article just out in The Modern Language Journal. Read more here: onlinelibrary.wiley.com/doi/10.1111/...
Congrats to guest editors Eunice Jang, Yasuyo Sawski & all authors on this blockbuster issue of #LanguageTesting on the AI revolution! It’s a hefty issue with 7 articles, an opening Editorial & an insightful post-script by @lukeharding.bsky.social. Now out here: journals.sagepub.com/toc/LTJ/curr...
We recruited a large number of layperson listeners to complete transcriptions (one utterance at a time) and comprehensibility ratings of archived Aptis responses and compared those listener-based measures to official scores. Methodologically this is one of my favorite projects!
Our new study on intelligibility and comprehensibility in the @britishcouncil.bsky.social Aptis General exam is out now in SiLA. Intelligibility distinguishes A1/above A1, but it's generally comprehensibility that is associated with higher levels of performance.
www.altaanz.org/7_sila_14_1_...
Sycophantic LLMs rob us of the chance to practice what makes us human: thought, revision, interaction, introspection. I’m much more worried about everyone losing those skills than I am about a Terminator-style end of the world.
Accurate.
#academia #academy #philsky #philosophysky #philosophy #science
Happy to announce that our article (w/ Sandra Götz & @pekkalintunen.bsky.social) exploring L1 & L2 speech #fluency in monologue + dialogue is now out in SSLA (OA)! Part of the recently completed #FDF2 project (funded by the Research Council of Finland). #FlowLangGroup @utu.fi doi.org/10.1017/S027...
ggplot2 4.0.0 is out and the new `paper`, `ink`, `accent` theme variables look super cool! Just pick 2-3 colors 🎨 to make your plots look great! I'm excited to hook this up to brand.yml 😉
Have One On Me compite con Vulnicura por disco de desamor ❤️
Models as Prediction Machines: How to Convert Confusing Coefficients into Clear Quantities Abstract Psychological researchers usually make sense of regression models by interpreting coefficient estimates directly. This works well enough for simple linear models, but is more challenging for more complex models with, for example, categorical variables, interactions, non-linearities, and hierarchical structures. Here, we introduce an alternative approach to making sense of statistical models. The central idea is to abstract away from the mechanics of estimation, and to treat models as “counterfactual prediction machines,” which are subsequently queried to estimate quantities and conduct tests that matter substantively. This workflow is model-agnostic; it can be applied in a consistent fashion to draw causal or descriptive inference from a wide range of models. We illustrate how to implement this workflow with the marginaleffects package, which supports over 100 different classes of models in R and Python, and present two worked examples. These examples show how the workflow can be applied across designs (e.g., observational study, randomized experiment) to answer different research questions (e.g., associations, causal effects, effect heterogeneity) while facing various challenges (e.g., controlling for confounders in a flexible manner, modelling ordinal outcomes, and interpreting non-linear models).
Figure illustrating model predictions. On the X-axis the predictor, annual gross income in Euro. On the Y-axis the outcome, predicted life satisfaction. A solid line marks the curve of predictions on which individual data points are marked as model-implied outcomes at incomes of interest. Comparing two such predictions gives us a comparison. We can also fit a tangent to the line of predictions, which illustrates the slope at any given point of the curve.
A figure illustrating various ways to include age as a predictor in a model. On the x-axis age (predictor), on the y-axis the outcome (model-implied importance of friends, including confidence intervals). Illustrated are 1. age as a categorical predictor, resultings in the predictions bouncing around a lot with wide confidence intervals 2. age as a linear predictor, which forces a straight line through the data points that has a very tight confidence band and 3. age splines, which lies somewhere in between as it smoothly follows the data but has more uncertainty than the straight line.
Ever stared at a table of regression coefficients & wondered what you're doing with your life?
Very excited to share this gentle introduction to another way of making sense of statistical models (w @vincentab.bsky.social)
Preprint: doi.org/10.31234/osf...
Website: j-rohrer.github.io/marginal-psy...
cover of the journal Studies in Second Language Acquisition
New issue of Studies in Second Language Acquisition now available
📚 cup.org/47eF6ql
#SSLA #appliedlinguistics #LangSky
While I'm dispensing computer advice, to any new graduate students:
Save all your article PDFs using the same template. I like LASTNAME YEAR SHORT TITLE/DESCRIPTION, like
Labov 1963 Marthas Vineyard
Save them all in just one giant folder, not piecemeal by class. Thank me when you're tenured.
The new Cracker Barrel logo is the visual version of a 50 year old saying skibidi.
{tinytable} 0.13.0 for #RStats can now draw interactive tables with sorting, filtering & pages!
And the 📦 is still an ultra-simple, powerful, and 0-dep way to create beautiful tables in basically any format: tex, typ, docx, pdf, png, md, etc
Gallery & Docs: vincentarelbundock.github.io/tinytable/
Papers often conclude "more research is needed" without explanation. This is a missed opportunity. You are the expert. This is your time to shine. Explain what the remaining uncertainties are, and give justified recommendations on what the research needed to resolve them should look like.
I've said it before, and I will say it again:
It's astonishing how emotionally aversive revisions are, even in the best possible case (reviewers are positive and have very actionable remarks).
Or at least that's how I work; if anybody has some good recommendations for coping, do let me know 😂
I shouldn't wade into this discourse, but LANGUAGE CHANGES and old people always hate what kids are doing. This is fine, especially if the author intends it to be euphemistic. Language policing always ends in systematic discrimination against marginalized people. We don't want to go down this road!
Now available in OnlineFirst, Shungo Suzuki and colleagues propose a new approach to weakness identification for L2 speaking skills in diagnostic language assessment (DLA), with an AI-driven DLA program that uses contextualized diagnostic feedback to improve lexical use. doi.org/10.1177/0265...
At this point, I might as well --
Here's an infographic showing different ways to include age as a predictor. The top shows two extremes, just as a plain old numerical predictor (imposes linear trajectory) vs. categorical predictor (imposes nothing whatsoever). And then three solutions in between!
Call for Special Issue abstracts is now open! The Language Testing Special Issue theme of 2027 is “Language Assessment and Policy”. Abstract submission deadline: October 1, 2025. journals.sagepub.com/page/ltj/spe...
Our study of English proficiency tests in US R1 university admissions circa late 2023 has been published in #LanguageTesting. Some highlights:
- TOEFL, IELTS, DET, PTE are most widely used
- Most institutions accept around 5 different tests
- Some mismatches in cut scores across tests
...
Plot that depicts the average importance people in my data assign to their friendships (y-axis, on a scale from 1 to 5, depicted with 95% confidence intervals) by their age (x-axis, from 18 to 60). Depicted are 3 different ways to model importance of friends as a function of age. Using age as a linear predictor: this imposes a linear trajectory which comes with very tight confidence intervals (i.e., uncertainty is low). Using age as a categorical predictor: this imposes no trajectory whatsoever but instead simply reproduces the means by age. The confidence intervals are very wide, in particular for those ages not well represented in the data (i.e., uncertainty is high). Age splines: This results in a smooth trajectory that follows some of the bumps in the data, but not all of them. The confidence intervals are somewhere between the linear and the categorical case (i.e., uncertainty is medium)
Let's say you want to include age as a predictor in your model. How do you do that?
Here's an illustration of three options -- it's for a paper I'm working on (so if you feel like anything could be tweaked...).