#IA « Pourquoi je suis mal à l’aise avec l’utilisation de l’IAG, chez les archivistes » par Julien Benedetti, archiviste. macgraveur.github.io/mapage/blocn... #archives #IST #IAg
Posts by Alix Chagué 🌈
Turning a new page with eScriptorium... and we wanted to talk about it with you.
In a blog written with Mathew Barber, we went into explaining what 1.0 is, represents for our team, and how we are trying to improve our transparency, how "our team" work.
escriptorium.eu/blog/2026-04...
We are looking for a full stack developer to join our team for eScriptorium !
Application deadline March 31st:
www.euraxess.fr/jobs/417083
If you use GitHub (especially if you pay for it!!) consider doing this *immediately*
Settings -> Privacy -> Disallow GitHub to train their models on your code.
GitHub opted *everyone* into training. No matter if you pay for the service (like I do). WTH
github.com/settings/cop...
Poster advertising lectures on "Raisonnement Philologique et Modèles Informatiques" stating at 4pm, Thursday, March 12, at 54 Boulevard Raspail, Paris.
Paris friends! Amis parisiens ! This Thursday is the first of four public lectures I'm giving on AI and philology, broadly defined: "Philological Reasoning and Computational Models." The advertisement is in French, but the lectures are in English. I'd also love to meet while I'm here in March! 1/
Workshop (Call for proposals) – Analysing Cultural Heritage Documents: HTR/OCR, Information Extraction, and Textual Variation prima.hypotheses.org/3737
This week saw the release of eScriptorium 1.0.0, the end of years of work and a last run done by H, Aguili at @inriaparisnlp.bsky.social
The new UI is much better, you can discover it at youtu.be/WEf42RlweZA
The deadline for submissions to the upcoming Digital Humanities Summer Institute aligned conference in Montreal is this Friday, January, 30, 2026.
La date limite pour soumettre des propositions au colloque parallèle du Digital Humanities Summer Institute est fixée à ce vendredi 30 janvier 2026.
Excellent ! Je te rejoins sur nombre de tes questionnements, y compris celui de l'équilibre entre temps disponible et perfection de l'écriture...
'“We received a total of 32,763 manuscripts, mostly in Old French and Latin, which we transcribed in four months”, explains Thibault Clérice - infinitely quicker than it would have taken to complete such a task manually.'
I wanted dinner recommendations so I scraped 13,000+ London restaurants and accidentally discovered Google Maps is running a shadow economy. Anyway here's a dashboard and a political economy thesis: open.substack.com/pub/laurenle...
Want to keep up with everything happening around #CHR2025 on Bluesky? Follow the feed we created to catch all related posts in one place: bsky.app/profile/did:...
I am very happy to not be repeating the AI class I did in fall 2023, but this is the first piece of AI-themed writing that made me laugh so hard I almost wish I could put it on the syllabus.
Now translated! 🙌
Thanks! (I'll take the time to translate it once I finish writing my current chapter 😉)
I added a new post on my research blog last week. I wanted to react to a post from Dan Cohen that I've seen circulating on BlueSky last week about Gemini 3, and figured I would add my critical 2 cents to the mix!
alix-tz.github.io/phd/posts/025/
Excellent ! Ce matin, j'ajoutais dans mon dernier billet, l'idée de "presti-générateur" pour parler des modèles qui font des tours de passe-passe pour dissimuler leurs erreurs, ça résonne bien avec cette notion de modèle bullshitters 👍
...et de découvertes qui finissent par se répondre. Si on ne publie que ce qui est mûrement recherché et réfléchi, on perd cet aspect là du carnet de recherche
Je crois qu'on peut très aisément revenir plus tard dans un autre billet pour approfondir un sujet déjà traité si finalement on en a le temps et/ou l'envie. Il me semble que les carnets permettent justement de suivre une pensée, des connexions qui germent sur le temps long, au fil de lectures...
Oui ! Depuis plusieurs mois je reprends vraiment goût à la lecture de carnets de recherche / billets de blogs, même quand ça part dans tous les sens ou que le sujet est traité superficiellement. Le blog ce n'est pas la réaction à chaud des RS mais ça donne à voir l'arrière boutique quand même !
Does anyone have a dataset of 1,000 + pages of handwritten text on Transkribus that they want to use for finetuning a VLM? If so, please let me know. This would be for any language and any script.
It's been brewing for months: @inriaparisnlp.bsky.social releases CoMMA (Corpus of Multilingual Medieval Archives) !
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
Pour les QCM, avec Moodle il est parfois possible de faire un export en XML, ça permet de faire des modifications à la main plus rapidement puis de réimporter dans un autre cours... Et de gagner un temps fou, vu comme c'est long en passant par l'interface graphique !
🚨Job ALERT🚨! My old postdoc is available!
I cannot emphasize enough how much a life-altering position this was for me. It gave me the experience that I needed for my current role. As a postdoc, I was able to define my projects and acquire a lot of new skills as well as refine some I already had.
By creating an Account with Academia.edu, you grant us a worldwide, irrevocable, non-exclusive, transferable license, permission, and consent for Academia.edu to use your Member Content and your personal information (including, but not limited to, your name, voice, signature, photograph, likeness, city, institutional affiliations, citations, mentions, publications, and areas of interest) in any manner, including for the purpose of advertising, selling, or soliciting the use or purchase of Academia.edu's Services.
I'm sorry, worldwide, irrevocable, non-exclusive, transferable permission to my voice and likeness? For what now? In any manner for any purpose???
This is in academia/.edu's new ToS, which you're prompted to agree to on login. Anyway I'll be jumping ship. You can find my stuff at hcommons.org.
Coming up on Oct 3-4, 2025 at Central European University: OCR/HTR Workshop for Under-resourced &Under-represented Languages in DH, funded by the Cluster of Excellence EurAsian Transformations &CLARIAH-AT! (Main organizer: yours truly) #digitalhumanities #multilingualdh #textrecognition #ocr #htr
Lately, I have taken the habit of restricting web searches to pages published before 2019, especially when I am looking for the definition of the difference between two terms or concepts. Otherwise 90% of the time, I get web pages that I think are AI slope and that don't help me at all!
What is the consistency score in your chart?