On our way to Tartu, Estonia, to present the #warc2corpus pipeline! Look forward to discuss how web archive collections can be turned into corpus data for distant reading.
#DHNB2025 #DigitalHumanities
#OpenScience #WebArchiving
#ComputationalSocialSciences
@dhnb.bsky.social @netpreserve.bsky.social
8
3
1
0