I'll be at #ICML2026 on July 10-11 in Seoul to speak at the Workshop on Culture x AI: Evaluating AI as a Cultural Technology.
The workshop is currently accepting submissions, with humanities, ML, HCI, and social/cognitive sciences all welcome.
Submit papers by May 1! Join us in Seoul! 🇰🇷
Posts by David Mimno
DH Awards apparently organized by Eeyore? Congrats to the winners, runners up, and all nominees!
I assume this is why all of the people I know who study robots don’t want to put anyone anywhere near a humanoid robot
Looking forward to the open source release, but it sounds like they’re treating model weights as static variables in code rather than data in an ML library, which means the Rust compiler can optimize 1000x better
CU Boulder is hiring!!
What a joy + honor to work w @walshbr.bsky.social to develop a new virtual #DigitalHumanities workshop track for #ACH2026! We'll be offering 3 fab classes (I wish I could take them all!). Whether you're new to the field or a seasoned DHer, we've got something for you, rooted in care + community
Today I scrolled through a chat I used to ponder...
Is that the first pelican riding right to left?
It’s great to see Whitaker’s legacy live on!
The Library of Virginia in Richmond seeks a data engineer ($100k-$125k) to transform data practices at a 200-year-old cultural heritage org with an eye towards the future.
Looking for someone to imagine & collaboratively implement tomorrow's data infrastructure.
Apply by May 1! Tell your friends!
Cornell started 20+ years ago. It's been great for InfoSci. We are part of a larger unit than if we were a separate iSchool, but a smaller and more focused unit than an A&S or Engin college.
Google dropped 4 different Gemma open-weight models! I'm most excited that they're finally adopting a standard Apache 2.0 open source license.
huggingface.co/collections/...
As AI gets more advanced, it’s becoming harder for publishers to tell whether a book was written by a human or AI-generated.
@theguardian.com spoke with professor @informor.bsky.social and assistant professor @nkgarg.bsky.social to unpack why AI is becoming harder to detect:
https://bit.ly/4sTcOcY
Today is the 190th anniversary of the publication of the first installment of The Pickwick Papers. I thought it would be interesting to read at the original cadence. This page reveals each section on the original timeframe, offset by 190 years:
mimno.infosci.cornell.edu/Pickwick/
You see a handy paper plate offering access to all (!) Popes, Roman Emperors, Emperors of the Holy Roman Empire, a lot of Christian kings, and more minor rulers. You need to move the paper pointer, find the correct ring, and read the information. #skystorians: it's an #earlymodern paper tool.
1/2
Submission deadline extended to April 15th!
ASA Pre-Conference: Science, Knowledge and Technology Today, 8/7 in NYC: asaskat.com/SKATtoday/
Thanks for making time for us!
English version: archive.org/details/theo...
Join us today
@cudanlab.bsky.social
for a lecture & discussion with
Axel Bax & David Mimno @dmimno.bsky.social
on "Extracting Character Movements"
📕💃🔬
16:00-18:00 Tallinn time UTC+2 🇪🇪🇪🇺🌍
(55 minutes from this post).
Details & zoom:
cudan.tlu.ee/events/2026-...
I regret missing #atmosphereconf but would love attendees to check out and give feedback on our labeler at @stechlab-labels.bsky.social -- a first-pass implementation looking at how labelers can improve the surfacing of transparent, objective, useful ATProto information about account behavior.
If you're reviewing ARR papers and want a tool to help you spot potential hallucinated references, I cooked this up for the ACL SACs and thought I would share it with the broader community github.com/davidjurgens...
This is a challenging legal problem for NeurIPS (and other conference participants)! You might be wondering how this is possible given the First Amendment?
I wrote a quick explainer on the current status quo of relevant First Amendment cases & law to get you up to speed.
🔗👇
Like every year around PhD decision time, here is a PhD syllabus I wrote to demystify and clarify what to expect from the PhD for students in my lab. It may be helpful for others making their decisions: it's not the same everywhere, but can tell you what q's to ask.
s.tech.cornell.edu/phd-syllabus/
"Album" cover for the LatinCy v3.9 pipelines with "catus" from Gesner's 1551 Historia Animalium
✨ LatinCy v3.9 sm/md/lg/trf pipelines for SpaCy available ✨
- Improved tokenization and u/v norm
- New custom Latin-specific XPOS tags
- Better, more consistent lemma/morph coverage
huggingface.co/latincy/la_c...
#digiclass #nlproc
Nope, it's just the only kind of argument I would find compelling
I was thinking about how Fahrenheit temperatures sound ridiculously hot to someone used to Celsius, so I had Claude make a translation table going both ways. e.g. saying it's 70° sounds like saying it's 160° to a F person.
mimno.infosci.cornell.edu/temperature-...
When the closure of HTRC was announced, more than two years ago, Hathi said it was going to develop a plan for continued research access to their collections. I hope that plan has been developed and is close to being released.
Join @taylor-arnold.bsky.social and me for a postdoc to expand distant viewing! 2 years. Awesome liberal
arts college. Great city. Option to teach 1-2 courses a year for extra pay. Thanks to Mellon for the opportunity! Happy to answer any questions.
richmond.wd5.myworkdayjobs.com/en-US/staff_...
We got tired of bolting accessibility onto R charts after the fact, so we built an R/d3.js viz library that starts from 508 compliance and works outward. 20 chart types, SVG, ARIA, keyboard nav. Still in beta — demo at morton-analytics.com/myio/
For example, iterated subspaces for eigenvectors is matrix-multiply, normalize, orthogonalize, repeat. You could unroll this as a deep neural network with a dense layer (weights tied between layers), a layernorm, and a QR non-linearity.