Roger Levy (@rplevy) Bsky

Building on Kyle's and Jenn's responses – it seems to me the analogy is: grammaticality is to BLiMP and SyntaxGym as truth is to COMPS and plausibility (albeit that's not binary) is to EWoK. So, to apply our framework to those datasets, perhaps one should swap truth/plausibility for grammaticality?

5 months ago 2 0 1 0

Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

New work to appear @ TACL!

Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.

Yet they often assign higher probability to ungrammatical strings than to grammatical strings.

How can both things be true? 🧵👇

5 months ago 92 21 2 3

thinking of calling this "The Illusion Illusion"

(more examples below)

1 year ago 1580 386 60 91

Helps to be a linguist!

1 year ago 4 0 0 0

Been listening a lot to Ella Jenkins the last couple of days. What wonderful music and performance!

1 year ago 7 1 1 0

Within the past two weeks I deleted apps for media companies owned by 67% of the world’s richest billionaires, and it felt great!

1 year ago 33 2 1 0

Not quite sure what you mean by a “complete” corpus. I do think the basic philosophical assumptions of frequentist probability are applicable to corpora, using the large-numbers-of-native-speakers thought experiment.

And productivity is a property of the asymptotic distribution, if I’m getting you.

1 year ago 0 0 0 0

If there were enough native speakers of the language living at once, you’d quickly get enough instances of the prefix for relative frequency estimation of the next token distribution. Too few humans are alive for this in practice, but that’s not a problem for theoretical validity of the construct!

1 year ago 1 0 1 0

You might be interested in this paper we did some time ago!

escholarship.org/content/qt69...

It supports your conjecture that, insofar as we think the “true distribution” is a valid theoretical construct (which I consider a highly defensible position), large-N Cloze would not give it to us.

1 year ago 7 0 1 0

Results of high stakes elections that happen only once every four years offer remarkable opportunities for overfitting theories of the electorate

1 year ago 10 1 0 0

The book The Patterns of Comics by Neil Cohn

Interior page from The Patterns of Comics by Neil Cohn

Back cover of The Patterns of Comics by Neil Cohn

It's my book's release day! The Patterns of Comics is now officially published, featuring an extended data-driven analysis of the structures used in 350+ comics from Asia, Europe, and North America analyzing diversity, regularity, and change over time www.visuallanguagelab.com/poc

2 years ago 180 59 6 2

screenshot of title and authors of paper + map with 18 colorful box callouts showing where datasets came from

GOOD MORNING BLUESKY!
Very excited about this new paper:

www.pnas.org/doi/10.1073/pnas.2300671120

Key Q: what predicts how much young kids (👶)talk?

How much 🗣 kids heard predicted how much 👶talked, but other factors, e.g. mom’s education, didn’t. #PsychSci #DevPsy 🗣💬

INCOMING SUMMARY🧵ALERT 1/14

2 years ago 157 79 5 1

#linguistics Bluesky: what are the best available quantitative measures of dialect/language mutual intelligibility? The more fine-grained, the better: I'm hoping to vividly illustrate at least one specific dialect continuum (e.g., the Romance languages of the Mediterranean coast)

2 years ago 3 1 2 0

Today in linguists are NOT KIDDING when we say that your capacity for language enables you to understand sentences that have never before been uttered in human history.

2 years ago 534 189 3 4

New postdoc opportunity to work jointly with @cantlonlab.bsky.social and me to understand cognition across species, age, and culture! cmu.wd5.myworkdayjobs.com/CMU/job/Pitt...

2 years ago 33 28 0 1

Absolutely, the Nature EiC has it completely backwards. Checking for errors and quality of data (and of math, code, and argumentation) is the most important work that reviewers can do.

2 years ago 18 1 1 0

Screenshot of portion of article linked to in post, where Nature EiC says that checking underlying data is not the job of peer review.

The quotes from Nature EiC Magdalena Skipper about whether journals should be checking for errors/data quality as part of peer review are quite surprising to me.

www.wsj.com/science/whats-wrong-with...

2 years ago 78 35 14 8

“This significant effect was found using a post hoc weighting procedure aligned with our overarching hypothesis”?!?

2 years ago 1 0 0 0

Snow geese fill the sky at sunset in Washington's Skagit Valley.

I am delighted to announce that the Department of Biology at the University of Washington is advertising for a tenure-track assistant professor position on the quantitative understanding of collective behavior.

I will be chairing the search; details are here: apply.interfolio.com/130336

2 years ago 266 151 9 6

Glushko Dissertation Prize - Cognitive Science Society

I think it’s amazing that Cognitive Science gives recent PhDs $10K in UNRESTRICTED CASH…right when folks are broke, exhausted, moving town…and need it most.

It’s almost Glushko season!

cognitivesciencesociety.org/glushko-diss...

2 years ago 12 8 1 0

LIT Lab Our goal is to understand the relationship between language and human thought. How does the language network in the brain interact with other systems to interpret meaning in the world? Can models...

First post and big news - I am starting as an Assistant Professor in Psychology at Georgia Tech in Jan 2024!

www.language-intelligence-thought.net

2 years ago 67 11 4 3

In a new TiCS article, @emaliemcmahon.bsky.social and I review a growing body of behavioral, neural, and computational evidence that social interactions are automatically extracted by the human visual system:

tinyurl.com/nhh2dhxt

#PsychSciSky #NeuroSkyence

2 years ago 63 30 3 1

While the world has its eyes on the Middle East, democratic conditions in Indonesia are looking grim. The Supreme Court has overruled the Constitution in order to allow the sitting president's son to stand as Vice Presidential candidate with a disgraced general with a stained human rights record

2 years ago 29 9 1 3

LIU LAB In the Look, Infer, and Understand (LIU) Lab at Johns Hopkins University, we are interested in how our minds and brains reason about the physical and social world. We study the developmental and neu...

I am reading PhD applications this year, with a special interest in students who would like to work on the topic of perceived danger. But open to all applicants who share some of my interests. Visit www.liulaboratory.org to see papers, lab values, and tips for application writing.

2 years ago 14 11 0 0

Supervisory Research Scientist (Interdisciplinary) This position is located at the Consumer Financial Protection Bureau (CFPB), Office of Research. The incumbent supervises and conducts independent, self-directed social/behavioral analysis on a variet...

The Consumer Financial Protection Bureau (CFPB) is hiring a section chief for the psych/”behavioral” section and I'd love to see some CogSci representation in there! My brother works there (trained as an economist) and it's an incredible gig doing research in the public interest.

2 years ago 6 9 0 0

Santa Fe Institute now has a Blue Sky account: @sfiscience.bsky.social

2 years ago 39 12 1 0

Spatial communication systems across languages reflect universal action constraints - Nature Human B... Coventry et al. show that spatial demonstratives—such as ‘this’ and ‘that’ in English—are selected on the basis of whether the speaker is able to reach the object or not, across 29 diverse...

A new cross-linguistic study on demonstratives by a team of psychologists and linguists: "Commonalities and differences across languages in spatial communication can be understood in terms of universal constraints on action shaping spatial language and cognition." www.nature.com/articles/s41...

2 years ago 16 4 0 1

PNAS Proceedings of the National Academy of Sciences (PNAS), a peer reviewed journal of the National Academy of Sciences (NAS) - an authoritative source of high-impact, original research that broadly spans...

New paper out!

"Large language models show human-like content biases in transmission chain experiments"
#CulturalEvolution #cssky 🧪

www.pnas.org/doi/10.1073/...

2 years ago 41 26 1 1

new work just dropped, see @stephan-meylan.bsky.social's "thread" below:)
#DevPsych #CogPsych #PsychSciSky #CogSciSky

2 years ago 19 3 1 0

Thrilled at publication of
@stephan-meylan.bsky.social's "How adults understand what young children say", featuring Bayesian noisy-channel inference, LLMs, & child speech datasets!

TL;DR: prior expectations of what kids *want to say* is crucial. (Knowing how kids mispronounce words is too.)

2 years ago 23 9 0 1

Posts by Roger Levy