You can literally watch repression & displacement consolidate over fine-tuning: here's next-token probs across checkpoints of OLMo-3-7B-Think-SFT. Explicit words are repressed almost instantly but safer (displaced) alternatives emerge much later. It learns what not to say before what to say instead.
Posts by Wouter Haverals
and 🙏 to @remarquenyu.bsky.social / Digital Theory Lab for hosting!
Cultural AI felt very real this week at NYU. Outstanding conference; wide-ranging and argumentative in the best way, and full of people trying to build vocab and practices for this emerging space. Huge thanks to @leifw.bsky.social and @t-shoemaker.bsky.social for organizing it!
and 🙏 to the @remarquenyu.bsky.social / Digital Theory Lab for hosting!
The first research paper from WashU's AI Humanities Lab, which I co-direct with Gabi Kirilloff, is available now in the Harvard Data Science Review! Read to learn more about how (badly) current LLMs are at replicating literary style: doi.org/10.1162/9960...
“Tahnk you for reaching put! I will be OOOO four the holiday brake and look forward too connection when I return!”
There isn't a single problem "solved" by edtech that couldn't be fixed with smaller classes led by well-paid teachers given real academic freedom
What an absolute pleasure it was to attend the 6th edition of CHR! Huge thanks to everyone who helped make this edition even better than the last #CHR2025 Onward to Manchester, home of some wonderful computing history:
www.atlasobscura.com/articles/com...
Very happy to present the paper that @joeymccollum.bsky.social and I wrote on applying Bayesian phylogenetics to the medieval Dutch Martijn Trilogy at #CHR2025 :)
Presented my paper "The One and Only? Authorship Verification on Jan van Boendale and the Middle Dutch Antwerp School" on #CHR2025 and it won the ERC Best Long Paper Award! Delighted and grateful :) @comphumresearch.bsky.social
What do we reveal about ourselves when we talk to AI? 🤔💭
In our new WiAIR – Women in AI Research episode, we speak with Maria Antoniak about personal disclosures in human–LLM conversations — and what they mean for ethical AI development. (1/8🧵)
Elementary, my dear BERT!
improving authorship attribution using phonetic features — fascinating work by Simon Gabay, @floriancafiero.bsky.social, and Jean-Luc Falcone at #CHR2025
@wenyishang.bsky.social and @emilyue.bsky.social built a remark-based classifier, checking the consistency between 6th century Chinese poetry criticism and “grades” assigned to classical Chinese poetry #CHR2025
the original masking objective, or: filling in the black d•t in EEBO-TCP transcriptions, presented at #CHR2025 by Kiara M.H. Liu, Martin Mueller and @mattwilkens.bsky.social
@jbarre.bsky.social, @oseminck.bsky.social, Antoine Bourgois, and @tpoibeau.bsky.social built a detective detector, tracing the different archetypes in French detective fiction #CHR2025
cool work at #CHR2025 by @julianeugarten.bsky.social on the sentiment trajectories in fanfiction! @comphumresearch.bsky.social
Graphs showing authors from the Norton Anthology receiving checkout spikes after their death.
Excited to be in Luxembourg at CHR 2025 to hear about everyone’s amazing work and to share my project with @mellymeldubs.bsky.social and our team. We tracked canonical authors and texts in Seattle Public Library circulation data.
I made a feed for #CHR2025 @comphumresearch.bsky.social
bsky.app/profile/did:...
Incredible keynote by @miguelev.bsky.social at @comphumresearch.bsky.social #CHR2025 on what he calls “exploratory finetuning”! Drawing on EDA, art, STS and DH to develop innovative ways to OCR right to left languages (ex Malaysian) and theorizing the method. Such exciting cutting edge work!
I just presented my podcast dataset at #CHR2025. Interested in analysing 412 days worth of podcast episodes? You can find the fully transcribed dataset here: zenodo.org/records/1746... 🎙️
I added a new post on my research blog last week. I wanted to react to a post from Dan Cohen that I've seen circulating on BlueSky last week about Gemini 3, and figured I would add my critical 2 cents to the mix!
alix-tz.github.io/phd/posts/025/
📢 Deadline Extended! #DH2026 @dh2026daejeon.bsky.social proposal submissions now open until December 15, 2025 (KST). Please share! 🔂
📍 Daejeon, South Korea | July 27–31, 2026 🎯 Theme: "Engagement"
Submit your long/short papers, posters, workshops & mini-conferences!
🔗 dh2026.adho.org/cfp
Read my🧵about our new #AIstories publication, by @annesigrid.bsky.social - read on for floating motifs, cannibalism and more! And honestly just to understand more about what actually characterises LLM-generated storytelling.
“The Software Paper fills a gap for the computational and digital humanities communities...” Thank you to research software engineer extraordinaire @suttonkoeser.bsky.social for leading this initiative for Computational Humanities Research journal. Please share!
Editors' Choice: New research shows everyone prefers human writers, including AI! – CDH@Princeton cdh.princeton.edu/news/2025/11...
🗓️ The #CHR2025 programme is online! Browse what’s on the menu here: 2025.computational-humanities-research.org/programme/
Proceedings are coming soon as well. Don’t forget: registration closes on 20 November! #computationalhumanitiesresearch
I curated some readings for class on "data tensions" and the list felt worth sharing. Come on a tour of datasets, books, the web, and AI with me...
We'll start with this piece on the Google Books project: the hopes, dreams, disasters, and aftermath of building a public library on the internet.
1/n
that'd be so useful! reminds me of the Data Provenance Initiative; they audited 1.8K datasets and found 70% had unspecified/unreliable licenses! www.nature.com/articles/s42...