Oleg Urminsky (@olegurminsky) Bsky

UW graduate student deported through SEA as protesters demand answers A union representing University of Washington graduate student workers says Kennedy Orwa’s student visa was rescinded without explanation.

Our PhD student Kennedy Orwa, who studies applications of AI to health care, was hastily deported today to Kenya along with his 13-year-old son without opportunity to speak to legal counsel.

King 5 reports that he held a valid visa that was rescinded without explanation.

1 week ago 4159 2280 87 113

This was a many-years undertaking, starting with a very Google-specific question years ago that we kept working on into the dawn of AI chat search, which added a whole new dimension to the paper.

1 week ago 0 0 0 0

Our results suggest that, at least in the context of search, confirmation bias in framing questions is the primary factor -- otherwise the two biases would cancel out and there would be no effect of broadening search.

1 week ago 0 0 1 0

An interesting theoretical aspect of this practical question: research in psychology has identified two potential dimensions of confirmation bias, bias in framing question and bias in which answers we pay attention to.

1 week ago 0 0 1 0

When platforms adjust algorithms to return broader search results, people update their beliefs more (under some conditions). This can be a good thing, to the degree that the incremental information provided by broadening is accurate.

1 week ago 0 0 1 0

The narrow search effect and how broadening search promotes belief updating | PNAS Information search platforms, from Google to AI-assisted search engines, have transformed information access but may fail to promote a shared factu...

The paper, led by Eugina Leung, is here:
www.pnas.org/doi/10.1073/...

We document that the human tendency for confirmation bias in question framing (the tendency of people to frame search in terms of their prior beliefs) and algorithms that optimize for relevance combine to impede belief updating.

1 week ago 6 3 1 0

Should AI Disagree with You? Research suggests we shape our online queries in a way that confirms our views.

Very excited that this @chicagoboothreview.bsky.social interview and our paper on the "Narrow Search Effect" are both out:
www.chicagobooth.edu/review/podca...
🧵

1 week ago 3 1 1 0

People’s spontaneous searches are narrow, intended to produce results in line with their prior beliefs—which tend to persist, but when presented with broader information, they tend to update their beliefs more, research by Leung & @olegurminsky.bsky.social suggests:

1 month ago 2 1 0 0

OSF

When you collect data online, are the results from humans or AI? In a project led by Booth PhD student Grace Zhang, we estimate the prevalence of AI agents on commonly used survey platforms:
osf.io/preprints/ps...
🧵

1 month ago 110 50 4 5

Yes, off the shelf agents fail these and we do see failures on the platforms. There are also respondents who fail other checks who pass those. One limitation in our study, however, is that we bundled it with a typing check so we don't have as clear a read on specific checks.

1 month ago 1 0 0 0

Worth a look if you're running online studies. Happy to be a part of this, great work led by Grace. Check out Oleg's thread; feedback welcome!

1 month ago 4 2 0 0

1 month ago 0 0 0 0

Agreed, we did not test proctoring.

1 month ago 1 0 1 0

Nope, not dead at all, it's actually the primary method for collecting custom survey data in social science. My personal opinion is that online data collection is an incredibly valuable resource that we need to invest in saving from obliteration by AI, your view may differ.

1 month ago 4 0 1 0

Ooops. Meant to say "AI checks can also mistake non-compliant human respondents for AI."

1 month ago 0 0 0 0

Feedback, questions are welcome!

1 month ago 3 0 0 0

Some caveats. Off-the-shelf AI agents can pass survey checks with human assistance and AI agents can be purpose-built to pass checks without human assistance. Detecting AI agents is a moving target: ongoing independent testing of survey platforms is needed.

1 month ago 4 0 1 0

Using AI checks to screen out respondents is bad, helping AI learn to evade checks. Better to just collect data and pre-register exclusions (and move to a better platform when the fail rate is too high).

1 month ago 4 0 2 0

Using writing task manipulations online Writing tasks are a commonly used manipulation in behavioral research, to get people to consider specific information, invoke a mindset, or potentially change people’s emotional or other ment…

espondents for AI. Survey participants ignoring instructions and copy-pasting text into open-ended responses has been a problem for a long time:
marginallysignificant.com/2019/03/18/u...

1 month ago 3 1 2 0

Some thoughts. Relying on just one AI check is not a good idea – AI agents differ in their capabilities. Use multiple tests, varied regularly. Not checking against a human baseline may lead to over-estimating AI agents: AI and humans are bad at some of the same things.

1 month ago 4 0 1 0

This can matter for survey results. Platforms with more AI failures estimated less disapproval of using AI to complete surveys. Excluding potential AI agents reduced those differences (vs. humans at Mindworks @CDR_Booth).

1 month ago 3 1 1 0

The results differ substantially across platforms. @joinprolific.bsky.social and @cloudresearch.bsky.social ’s Connect panel have relatively low failure rates, while Mturk (even via @cloudresearch.bsky.social) has a high failure rate.

1 month ago 11 1 2 0

We use five AI checks, validating that common AI agents fail our checks but in-person human respondents do not. We then collect data on 7 online platforms.

1 month ago 5 0 1 0

The potential existential threat of large language models to online survey research | PNAS The advancement of large language models poses a severe, potentially existential threat to online survey research, a fundamental tool for data coll...

Recent work by @seanjwestwood.bsky.social in PNAS has raised a red flag about AI agents being able to complete online surveys.
www.pnas.org/doi/10.1073/...
www.pnas.org/doi/10.1073/...

1 month ago 4 0 1 0

OSF

When you collect data online, are the results from humans or AI? In a project led by Booth PhD student Grace Zhang, we estimate the prevalence of AI agents on commonly used survey platforms:
osf.io/preprints/ps...
🧵

1 month ago 110 50 4 5

"The day of accountability will come": Rep. Delia Ramirez on abolishing and prosecuting DHS The Handbasket spoke with the congresswoman who called out ICE long before it was popular.

“I've said to a couple of my colleagues, like, man, what would it have looked like in 2024 when we're in the thick of the campaigns, if we stopped being so defensive on immigration and we went on the offensive; had we all collectively rang the alarm when we saw it start.”

2 months ago 347 45 0 2

From this follow simple recommendations: as a default, meta-scientific studies of published research artefacts need to include 1) a full, identifiable list of included studies, 2) the full coding instrument and decision rules, and 3) the individual ratings together with a codebook.

2 months ago 15 6 1 0

Our instinct to seek confirmation leads to ‘narrow’ internet search behaviour.

Chatbots, trained to be helpful, tend to go along with this, but could be trained to help us update our beliefs.

@olegurminsky.bsky.social researched this and explains what he found:

buff.ly/0eafZ78

2 months ago 2 1 0 0

Should AI Disagree with You? Research suggests we shape our online queries in a way that confirms our views.

"It's a general issue in lots of technology that the technology is designed to try to be helpful to us and around our needs, but often it takes a simple view of what those needs are and leaves out some of the needs." - @olegurminsky.bsky.social

www.chicagobooth.edu/review/podca...

2 months ago 0 1 0 0

This is one of the most beautiful things I have witnessed, the craft here is impeccable.

8 months ago 22065 8395 463 1177

Posts by Oleg Urminsky